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CELL CYCLE AND PROLIFERATION PROTEINS 

TECHNICAL FIELD 1 

This invention relates to nucleic acid and amino acid sequences of cell cycle and proliferation 
5 : proteins and to the use of these sequences in the diagnosis, treatment, and prevention of immune, 
developmental, and cell signaling disorders, and cell proliferative disorders including cancer. < 

BACKGROUND OF THE INVENTION 

Cell division is the fundamental process by which all living things grow and reproduce. In 
0 unicellular organisms such as yeast and bacteria, each cell division doubles the number of organisms, 
while in multicellular species many rounds of cell division are required to replace cells lost by wear or 
by programmed cell death, and for ceil differentiation to produce a new tissue or organ. Details of the 
cell division cycle may vary, but the basic process consists of three principal events. The first event, 
interphase, involves preparations for cell division, replication of the DNA, and production of essential 
5 proteins. In the second event, mitosis, the nuclear material is divided and separates to opposite sides of 
the cell. The final event, cytokinesis, is division and fission of the cell cytoplasm. The sequence and 
timing of cell cycle transitions are under the control of the cell cycle regulation system which controls 
the process by positive or negative regulatory circuits at various check points. 

Mitosis marks the end of interphase and concludes with the onset of cytokinesis. There are four 
stages in mitosis, occurring in the following order: prophase, metaphase, anaphase and telophase 
Prophase includes the formation of bi-polar mitotic spindles, composed of mictrotubulcs and associated 
proteins such as dynein, which originate from polar mitotic centers. During metaphase, the nuclear 
material condenses and develops kinetochore fibers which aid in its physical attachment to the mitotic 
spindles. The ensuing movement of the nuclear material to opposite poles along the mitotic spindles 
occurs during anaphase. Telophase includes the disappearance of the mitotic spindles and kinetochore 
fibers from the nuclear material. Mitosis depends on the interaction of numerous proteins. For 
example, mutation studies in the Drosophila melanogaster zwW gene show a disruption in chromosome 
segregation. ZW10 protein appears to function at the kinetochore as a tension-sensing checkpoint 
during the onset of anaphase. ZW1 0 appears to have a direct role in the recruitment of dynein to the 
kinetochore, and, dyncin's involvement in the coordination of chromosome separation at the onset of 
anaphase and/or poleward movement (Starr, D.A. et al. (1998) J. Cell Biol. 142:763-774). 

Regulated progression of the cell cycle depends on the integration of growth control pathways 
with the basic cell cycle machinery. Cell cycle regulators have been identified by selecting for human 
and yeast cDNAs that block or activate cell cycle arrest signals in the yeast mating pheromone pathway 
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when they are ovcrexpressed. Known regulators include human CPR (cell cycle progression 
restoration) genes, such as CPR8 and CPR2, and yeast CDC (cell division control) genes, including 
CDC91, that block the arrest signals. The CPR genes express a variety of proteins including cyclins, 
tumor suppressor binding proteins, chaperones, transcription factors, translation factors, and 
RNA-binding proteins (Edwards, M.C. et al. (1997) Genetics 147:1063-1076). : 

The human CDC protein, CDC23, is homologous to the S. cerevisiae protein CDC23 which 
functions in the transition from metaphase to anaphase as well as in the exit from mitosis (Zhao, N. et 
al. (1998) Genomics 53:184-190). The C elegans gene cullin-1 (cull) is a negative regulator of the cell 
cycle, cull regulates the Gl to S phase transition and C. elegans cull mutants exhibit hyperplasia of all 
tissues through acceleration of this transition by overriding mitotic arrest, cull is a member of a 
conserved gene family that spans S. cerevisiae , nematodes and humans (Kipreos, E.T. et al. (1996) Cell 
85:929-839). 

Several cell cycle transitions, including the entry and exit of a cell from mitosis, are dependent 
upon the activation and inhibition of cyclin-dependent kinases (Cdks). The Cdks are composed of a 
kinase subunit, Cdk, and an activating subunit, cyclin, in a complex that is subject to many levels of 
regulation. There appears to be a single Cdk in Saccharomvces cerevisiae and Schizosaccharomyces 
pombe whereas mammals have a variety of specialized Cdks. Cyclins act by binding to and activating 
cyclin-dependent protein kinases which then phosphorylate and activate selected proteins involved in the 
mitotic process. The Cdk-cyclin complex is both positively and negatively regulated by 
phosphorylation, and by targeted degradation involving molecules such as CDC4 and CDC53. In 
addition, Cdks are further regulated by binding to inhibitors and other proteins such as Sucl that 
modify their specificity or accessibility to regulators (Patra, D. and W.G. Dunphy (1996) Genes Dev. 
10:1503-1515; and Mathias, N, et al. (1996) Mol. Cell Biol. 16:6634-6643). 
Reproduction 

The male and female reproductive systems are complex and involve many aspects of growth 
and development. The anatomy and physiology of the male and female reproductive systems are 
reviewed in Guyton, A.C. ((1991) Textbook of Medical Physiology , W.B. Saunders Co., Philadelphia 
PA, pp.899-928). 

The male reproductive system includes the process of spermatogenesis, in which the sperm are 
formed. Male reproductive functions are regulated by various hormones. The hormones exert their 
effects on accessory sexual organs, and are involved in cellular metabolism, growth, and other bodily 
functions. 

Spermatogenesis begins at puberty as a result of stimulation by gonadotropic hormones 
released from the anterior pituitary. Immature sperm (spermatogonia) undergo several mitotic cell 
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divisions before undergoing meiosis and full maturatioa The testes secrete several male sex hormones. 
Testosterone, the most abundant, is essential for growth and division of the immature sperm, and for the 
masculine characteristics of the male body. Three other male sex hormones, gonadotropin-releasing 
hormone (GnRH), luteinizing hormone (LH), and follicle-stimulating hormone (FSH), control sexual 
function. 

The uterus, ovaries, fallopian tubes, vagina, and breasts comprise the female reproductive 
system. The ovaries and uterus are the source of ova 1 and the location of fetal development, 
respectively. The fallopian tubes and vagina are accessory organs attached to the top and bottom of the 
uterus, respectively. Both the uterus and ovaries have additional roles in the development and loss of 
reproductive capability during a female's lifetime. The primary role of the breasts is lactation. 
Multiple endocrine signals from the ovaries, uterus, pituitary, hypothalamus, adrenal glands, and other 
tissues coordinate reproduction and lactation. These signals vary during the monthly menstruation 
cycle and during the female's lifetime. Similarly, the sensitivity of reproductive organs to these 
endocrine signals varies during the female's lifetime. 

A combination of positive and negative feedback to the ovaries, pituitary and hypothalamus 
glands controls physiologic changes during the monthly ovulation and endometrial cycles. The anterior 
pituitary secretes two major gonadotropin hormones, follicle-stimulating hormone (FSH) and luteinizing 
hormone (LH), regulated by negative feedback of steroids, most notably by ovarian estradiol. If 
fertilization does not occur, estrogen and progesterone levels decrease. This sudden reduction of the 
ovarian hormones leads to menstruation, the desquamation of the endometrium. 

Hormones further govern all the steps of pregnancy, parturition, lactation, and menopause. 
During pregnancy large quantities of human chorionic gonadotropin (hCG), estrogens, progesterone, 
and human chorionic somatomammotropin (hCS) arc formed by the placenta. hCG, a glycoprotein 
similar to luteinizing hormone, stimulates the corpus luteum to continue producing more progesterone 
and estrogens, rather than to involute as occurs if the ovum is not fertilized. hCS is similar to growth 
hormone and is crucial for fetal nutrition. 

The female breast also matures during pregnancy. Large amounts of estrogen secreted by the 
placenta trigger growth and branching of the breast milk ductal system while lactation is initiated by the 
secretion of prolactin by the pituitary gland. 

Parturition involves several hormonal changes that increase uterine contractility toward the end 
of pregnancy, as follows. The levels of estrogens increase more than those of progesterone. Oxytocin 
is secreted by the neurohypophysis. Concomitantly, uterine sensitivity to oxytocin increases. The fetus 
itself secretes oxytocin, Cortisol (from adrenal glands), and prostaglandins. 

Menopause occurs when most of the ovarian follicles have degenerated. The ovary then 
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prcxluces less estradiol, reducing the negative feedback on the pituitary and hypothalamus glands. 
Mean levels of circulating FSH and LH increase, even as ovulatory cycles continue. Therefore, the 
ovary is less responsive to gonadotropins, and there is an increase in the time between menstrual cycles. 
Consequently, menstrual bleeding ceases, and reproductive capability ends. 

5 Differentiation and Proliferation 

Tissue growth involves complex and ordered patterns of cell proliferation, cell differentiation, 
arid apoptosis. Cell proliferation must be regulated to maintain both the number of cells and their 
spatial organization. This regulation depends upon the appropriate expression of proteins^ which control 
cell cycle progression in response to extracellular signals, such as growth factors and other mitogens, 

10 and intracellular cues, such as DN A damage or nutrient starvation. Molecules which directly or 

indirectly modulate cell cycle progression fall into several categories, including growth factors and their 
receptors, second messenger and signal transduction proteins, oncogene products, tumor-suppressor 
proteins, and mitosis-promoting factors. 

Embryogenesis is a process in which distinct patterns of protein expression control proper 

15 development. This process involves a host of proteins each with distinct and highly coordinated 

expression patterns. For example, in the mouse, temporally regulated expression of two related genes 
Msgl and Mrgl contribute to normal embryonic development Msgl is expressed in the posterior 
domains of the developing mesoderm, while Mrgl is expressed in the anterior visceral endoderm. 
Properly coordinated expression of each protein throughout embryogenesis is critical for proper tissue 

20 and organ formation (Dunwoodie, S.L. et ah (1998) Mech. Dev. 72:27-40). 

Growth factors were originally described as serum factors required to promote cell 
proliferation. Most growth factors are large, secreted polypeptides that act on cells in their local 
environment. Growth factors bind to and activate specific cell surface receptors and initiate 
intracellular signal transduction cascades. Many growth factor receptors are classified as receptor 

25 tyrosine kinases which undergo autophosphorylation upon ligand binding. Autophosphorylation enables 
the receptor to interact with signal transduction proteins characterized by the presence of SH2 or SH3 
domains (Src homology regions 2 or 3) . These proteins then modulate the activity state of small G- 
proteins, such as Ras, Rab, and Rho, along with GTPase activating proteins (GAPs), guanine 
nucleotide releasing proteins (GNRPs), and other guanine nucleotide exchange factors. Small G 

30 proteins act as molecular switches that activate other downstream events, such as mitogen-activated 
protein kinase (MAP kinase) cascades. MAP kinases ultimately activate transcription of mitosis- 
promoting genes. 

In addition to growth factors, small signaling peptides and hormones also influence cell 
proliferation. These molecules bind primarily to another class of receptor, the trimeric G-protein 
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coupled receptor (GPCR), found predominantly on the surface of immune, neuronal and neuroendocrine 
cells. Upon ligand binding, the GPCR activates a trimeric G protein which in turn triggers increased 
levels of intracellular second messengers such as phospholipase C, Ca 2 *, and eyelid AMP. Most 
GPCR-mediated signaling pathways indirectly promote cell proliferation by causing the secretion or 1 
breakdown of other signaling molecules that have direct mitogenic effects. These signaling cascades 
often involve activation of kinases and phosphatases. Some growth factors^ such as some members of 
the transforming growth factor beta (TGF-p) family, act on some cells to stimulate cell proliferation 
and on other cells to inhibit it Growth factors may also stimulate a cell at one concentration and inhibit 
the same cell at another concentration. Most growth factors also have a multitude of other actions 
besides the regulation of cell growth and di vision: they can control, the proliferation, survival, * 
differentiation, migration, or function of cells depending on the circumstance. For example, the tumor 
necrosis factor/nerve growth factor (TNF/NGF) family can activate or inhibit cell death, as well as 
regulate proliferation and differentiation. The cell response depends on the type of cell, its stage of 
differentiation and transformation status, which surface receptors are stimulated, and the types of 
stimuli acting on the cell (Smith, A. et al. (1994) Cell 76:959-962; and Nocentini, G. et al. (1997) Proc. 
Natl. Acad. Sci. USA 94:6216-6221). 

Neighboring cells in a tissue compete for growth factors, and when provided with "unlimited" 
quantities in a perfused system will grow to even higher cell densities before reaching density-dependent 
inhibition of cell division. Cells often demonstrate an anchorage dependence of cell division as well. 
This anchorage dependence may be associated with the formation of focal contacts linking the 
cytoskelcton with the extracellular matrix (ECM). The expression of ECM components can be 
stimulated by growth factors. For example, TGF-P stimulates fibroblasts to produce a variety of ECM 
proteins, including fibronectin, collagen, and tenascin (Pearson, C.A. et al. (1988) EMBO J. 7:2977- 
2981). In fact, for some cell types, specific ECM molecules, such as laminin or fibronectin, may act as 
growth factors. Tenascin-C and -R, expressed in developing and lesioned neural tissue, provide 
sumulatory/anti-adhesive or inhibitory properties, respectively, for axonal growth (Faissner, A. (1997) 
Cell Tissue Res. 290:33 1-341). 

Cancers and immune disorders are characterized by uncoordinated cell proliferation. Cancers 
are associated with the activation of oncogenes which are derived from normal cellular genes. These 
oncogenes encode oncoproteins which convert normal cells into malignant cells. Some oncoproteins are 
mutant isoforms of the normal protein, and other oncoproteins are abnormally expressed with respect to 
location or amount of expressioa The latter category of oncoprotein causes cancer by altering 
transcriptional control of cell proliferation. Five classes of oncoproteins are known to affect cell cycle 
controls. These classes include growth factors, growth factor receptors, intracellular signal 
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transducers, nuclear transcription factors, and cell-cycle control proteins. Viral oncogenes are 
integrated into the human genome after infection of human cells by certain viruses. Examples of viral 
oncogenes include v-src, v-abl, and v-fps. Certain cell proliferation disorders can be identified by 
changes in the protein complexes that normally control progression through the cell cycle. A primary 
5 treatment strategy involves reestablishing control over cell cycle progression by manipulation of the 
proteins involved in cell cycle regulation (Nigg, E.A. (1995) BioEssays 17:471-480). 

Many oncogenes have been identified and characterized These include sis, erbA, erbB, her-2, 
mutated G 8 , src, abl, ras, crk, jun, fos, myc, and mutated tumor-suppressor genes such as RB, p53, ; 
mdm2, Cipl, pi 6, and cyclin D. Transformation of normal genes to oncogenes may also occur by 

10 chromosomal translocation. The Philadelphia chromosome, characteristic of chronic myeloid leukemia 
and a subset of acute lymphoblastic leukemias, results from a reciprocal translocation between 
chromosomes 9 and 22 that moves a truncated portion of the proto-oncogene c-abl to the breakpoint 
cluster region (bcr) on chromosome 22, 

Mutations which hyperactivate oncogenes result in cell proliferation. Stimulation of a cell by 

15 growth factors activates two sets of gene products, the early-response genes and the delayed-response 
genes. Early-response gene products include myc, fos, and jun, all of which encode gene regulatory 
proteins. These regulatory proteins lead to the transcriptional activation of a second set of genes, the 
delayed-response genes, which include the cell-cycle regulators Cdk and cyclins. For example, the 
human T-cell leukemia virus type I (HTLV-1) Tax transact! vator protein acts as an early response gene 

20 by enhancing the activity of a cellular transcription factor. The oncogenic properties of the Tax protein 
include transformation of primary T-lymphocytes and fibroblasts through cooperation with the a GTP- 
binding protein, Ras. Recently investigators have shown that Tax interacts with several PDZ- 
containing proteins. The PDZ domain, originally described in the Drosophila tumor suppressor protein 
Discs-Large, is common to membrane proteins thought to be involved in clustering receptors in growth 

25 factor signal transduction pathways (Rousset, R. et al. (1998) Oncogene 16:643-654). 

Tumor-suppressor genes are involved in regulating cell proliferation. Mutations which cause 
reduced or loss of function in tumor-suppressor genes result in uncontrolled cell proliferation. For 
example, the retinoblastoma gene product (RB), in a non-phosphorylated state, binds several early- 
response genes and suppresses their transcription, thus blocking cell division. Phosphorylation of RB 

30 causes it to dissociate from the genes, releasing the suppression, and allowing cell division to proceed. 

Other gene products involved in cell proliferation, differentiation, and apoptosis are yet to be 
discovered. One method currently being utilized to help identify such new molecules involves 
comparisons between quiescent and proliferative tissues. For example, a subtractive hybridization 
screen of human placental cytotrophoblast cells identified 20 genes whose expression levels rose due to 
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EGF induction of cell proliferatioa (Morrish, D.W. et al. (1996) Placenta 17:431-441). Another 
method involves identification of molecules produced in cells treated with ahti-tumorigenic agents, such 
as dithiolethiones. Presumably, the protective action of these anti-tumorigenic agents is associated with 
the induction of tumor suppressor gene products (Primiano, T. et al. .(1996) Carcinogenesis 17:2297- 
5 2303,). , , 

In another example, the candidate tumor-suppressor gene ING1, that codes a nuclear protein, 
p33ING 1 , is involved in the negative regulation of cell proliferation. The action of p33ING 1 is 
dependent upon the activity of another tumor-suppressor gene, p53. p53 is * cellular stress-responsive 
gene requiring the activity of p33INGl to effecUvely induce growth inhibition of cells. p33INGl "and 
10 p53 have been shown to physically associate through immunoprecipitation studies (Garkavtsev* I. et al. 
(1998) Nature 391 :295-298)/ 

Apoptosis i 

Apoptosis is the genetically controlled process by which unnceded or defective cells undergo 
programmed cell death. Selective elimination of cells is as important for morphogenesis and tissue 

15 remodeling as is cell proliferation and differentiation. Lack of apoptosis may result in hyperplasia and 
other disorders associated with increased cell proliferation. Apoptosis is also a critical component of 
the immune response. Immune cells such as cytotoxic T-cells and natural killer cells prevent the spread 
of disease by inducing apoptosis in tumor cells and virus-infected cells. In addition, immune cells that 
fail to distinguish self molecules from foreign molecules must be eliminated by apoptosis to avoid an 

20 autoimmune response. 

Apoptotic cells undergo distinct morphological changes. Hallmarks of apoptosis include cell 
shrinkage, nuclear and cytoplasmic condensation, and alterations in plasma membrane topology. 
Biochemically, apoptotic cells are characterized by increased intracellular calcium concentration, 
fragmentation of chromosomal DNA, and expression of novel cell surface components. 

25 The molecular mechanisms of apoptosis are highly coaserved, and many of the key protein 

regulators and effectors of apoptosis have been identified. Apoptosis generally proceeds in response to 
a signal which is transduced intracellular^ and results in altered patterns of gene expression and protein 
activity. Signaling molecules such as hormones and cytokines are known both to stimulate and to 
inhibit apoptosis through interactions with cell surface receptors. Transcription factors also play an 

30 important role in the onset of apoptosis. A number of downstream effector molecules, particularly 
proteases such as the cysteine proteases called caspases, have been implicated in the degradation of 
cellular components and the proteolytic activation of other apoptotic effectors. 
Aging and Senescence 

Studies of the aging process or senescence have shown a number of characteristic cellular and 
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molecular changes (Fauci, A.S. et al. (1998) Harrison's Principles of Internal Medicine . McGraw-Hill, 
New York NY, p.37). These characteristics include increases in chromosome structural abnormalities, 
DNA cross-linking, incidence of single-stranded breaks in DNA, losses in DNA methylation, and 
degradation of telomere regions. In addition to these DNA changes, post-translational alterations of 
5 proteins increase including deamidation, oxidation, cross-linking, and nonenzymatic glycosylation. Still 
further molecular changes occur in the mitochondria of aging cells through deterioration of structure. 
These changes eventually contribute to decreased function in every organ of the body. 

The discovery of new cell cycle and proliferation proteins and the polynucleotides encoding 
them satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
10 prevention, and treatment of immune, developmental, and cell signaling disorders, and cell proliferative 
disorders including cancer. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, cell cycle and proliferation proteins, referred to 

15 collectively as "CCYPR" and individually as "CCYPR-1," "CCYPR-2," "CCYPR-3," "CCYPR-4," 
"CCYPR-5," "CCYPR-6," "CCYPR-7," "CCYPR-8," "CCYPR-9," "CCYPR-10," "CCYPR-11," 
"CCYPR- 12," "CCYPR- 13," "CCYPR-14," "CCYPR-15," "CCYPR-16," "CCYPR-17," "CCYPR- 
18," "CCYPR-19," "CCYPR-20," "CCYPR-21," "CCYPR-22," "CCYPR-23," 4 CCYPR-24," 
"CCYPR-25," "CCYPR-26," "CCYPR-27," "CCYPR-28," "CCYPR-29," "CCYPR-30," "CCYPR- 

20 31," "CCYPR-32," "CCYPR-33," "CCYPR- 34," "CCYPR-35 "CCYPR-36," "CCYPR-37," 

"CCYPR-38," "CCYPR-39," "CCYPR-40," "CCYPR-41 ," "CCYPR-42," "CCYPR-43," "CCYPR- 
44," "CCYPR-45," "CCYPR-46," "CCYPR-47," "CCYPR-48," "CCYPR-49," "CCYPR-5G," 
"CCYPR-51," "CCYPR-52," "CCYPR-53," "CCYPR-54." In one aspect, the invention provides an 
isolated polypeptide comprising an amino acid sequence selected from the group consisting of a) an 

25 amino acid sequence selected from the group consisting of SEQ ID NO: 1-54, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-54, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-54, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-54. In one alternative, the invention 

30 provides an isolated polypeptide comprising the amino acid sequence of SEQ ID NO:l -54. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 
amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-54, b) a naturally occurring amino acid sequence having at least 
90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
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54, c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-54, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-54. In one alternative, the polynucleotide encodes a polypeptide selected 
from the group consisting of SEQ ID NO: 1-54. In another alternative, the polynucleotide is selected 
5 from the group consisting of SEQ ID Nd:55-108. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-54, b) a naturally occurring amino acid sequence having at least 90% 

10 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l -54, c) 
a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-54, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-54. In one alternative, the invention provides a cell transformed with the 
recombinant polynucleotide. In another alternative, the invention provides a transgenic organism 

1 5 comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-54, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-54, c) 

20 a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-54, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-54. The method comprises a) culturing a cell under conditions suitable for 
expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide 
comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) 

25 recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-54, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 

30 consisting of SEQ ID NO: 1-54, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l -54, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-54. 

The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
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SEQ ID NO:55-108, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:55-108, c) a 
polynucleotide sequence complementary to a) f d) a polynucleotide sequence complementary to b) f and e) 
an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
5 nucleotides., 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO.55-108, b) a naturally occurring polynucleotide sequence having at least 70% sequence 

10 identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:55-108, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide 
in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions 

15 whereby a hybridization complex is formed between said probe and said target polynucleotide or 
fragments thereof, and b) detecting the presence or absence of said hybridization complex, and 
optionally, if present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 
20 target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 

selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:55-108, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:55-108, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to h), and e) 
25 an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or 
fragment thereof using polymerase chain reaction amplification, and b) detecting the presence or 
absence of said amplified target polynucleotide or fragment thereof, and, optionally, if present, the 
amount thereof. 

The invention further provides a pharmaceutical composition comprising an effective amount 
30 of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-54, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO:l -54, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-54, and d) an immunogenic fragment of an amino acid 
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sequence selected from the group consisting of SEQ ID NO:l-54 t and a pharmaceutical] y acceptable 

excipienL In one embodiment, the pharmaceutical composition comprises an amino acid sequence 

, • - i r 

selected from the group consisting of SEQ ID NO:l-54. The invention additionally provides a method 

of treating, a disease or condition associated with decreased expression of functional CCYPR, 

5 comprising administering to a patient in need of such treatment the pharmaceutical composition. 

■ The invention also provides a method for screening a compound for effectiveness as an 

agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 

amino acid sequence selected from the group consisting &f SEQ ID NO: 1-54, b) a naturally occurrihg 

amino acid sequence having at least 90% sequence identity to an amino acid sequence selected irom the 

10 group consisting of SEQ ID NO: 1-54, c) a biologically active fragment of an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-54, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-54. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity in the 
sample. In one alternative, the invention provides a pharmaceutical composition comprising an agonist 

15 compound identified by the method and a pharmaceutical! y acceptable excipient. In another 
alternative, the invention provides a method of treating a disease or condition associated with 
decreased expression of functional CCYPR, comprising administering to a patient in need of such 
treatment the pharmaceutical composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 

20 an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -54, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-54, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-54, and d) an immunogenic fragment 

25 of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-54. The method 
comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
composition comprising an antagonist compound identified by the method and a pharmaceutically 
acceptable excipient. In another alto-native, the invention provides a method of treating a disease or 

30 condition associated with overexpression of functional CCYPR, comprising administering to a patient 
in need of such treatment the pharmaceutical composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-54, b) a naturally occurring amino 

35 acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
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consisting of SEQ ID NO: 1-54, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-54, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-54. The method comprises a) combining 
the polypeptide with at least one test compound under suitable conditions, and b) detecting binding 
5 of the polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-54, b) a naturally 

10 * occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-54, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-54, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:l-54. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 

15 permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 

presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 
change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

20 The invention further provides a method for screening a compound for effectiveness in 

altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:55-108, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

25 The invention further provides a method for assessing toxicity of a test compound, said 

method comprising a) treating a biological sample containing nucleic acids with the test compound; , 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID 

30 NO:55-108, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity 
to a polynucleotide sequence selected from the group consisting of SEQ ID NO:55-108, iii) a 
polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 

35 sample, said target polynucleotide comprising a polynucleotide sequence selected from the group 
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consisting of SEQ TD NO:55-108, ii) a naturally occurring polynucleotide sequence having at least 
70% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:55-108, iii) a polynucleotide sequence complementary to i), iv) a polynucleotide sequence 
complementary to ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide 
5 comprises a fragment of the above polynucleotide sequence; c) quantifying the amount of 
hybridization complex; and d) comparing the amount of hybridization complex in the treated 

biological sample with the amount of hybridization complex in an untreated biological sample, 

w * '" • 

whereina difference in the amount of hybridization complex in the treated biological sample is 

indicative of toxicity of the test compound. 

10 - = v ; - ' ■: 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding CCYPR. 
15 Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 

sequences, and methods, algorithms, and searchable databases used for analysis of CCYPR. 

Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 
patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
conditions associated with these tissues; and the vector into which each cDNA was cloned 
20 Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 

encoding CCYPR were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

25 DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 

30 be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 

35 forth. 
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Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described 
5 All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the invention is 
not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

10 "CCYPR" refers to the amino acid sequences of substantially purified CCYPR obtained from 

any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
CCYPR. Agonists may include proteins,, nucleic acids, carbohydrates, small molecules, or any other 

1 5 compound or composition which modulates the activity of CCYPR either by directly interacting with 
CCYPR or by acting on components of the biological pathway in which CCYPR participates. 

An "allelic variant" is an alternative form of the gene encoding CCYPR. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered A gene may have none, one, or 

20 many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times in 
a given sequence. 

"Altered" nucleic acid sequences encoding CCYPR include those sequences with deletions,. 

25 insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as CCYPR or a 
polypeptide with at least one functional characteristic of CCYPR. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding CCYPR, and improper or unexpected hybridization to allelic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide sequence encoding CCYPR. 

30 The encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of 
amino acid residues which produce a silent change and result in a functionally equivalent CCYPR. 
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the 
biological or immunological activity of CCYPR is retained. For example, negatively charged amino 
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acids may include aspartic acid and glutamic acid, and positively charged amino acids may include 
lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity 
values may include: asparagine and glutamine; and serine and threonine. Amino acids with 
uncharged side chaips having similar hydrophilicity values may include: leucine, isoleucine, and 
valine; glycine and alanine; and phenylalanine and tyrosine. 

_ The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide,, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Whore "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein^molecule, "amino acid sequence" and likeWms are not meant to limit the amino acid sequence 
to the complete native amino acid-sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 
in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
CCYPR. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of CCYPR either by 
directly interacting with CCYPR or by acting on components of the biological pathway in which 
CCYPR participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind CCYPR polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole 
limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DN A; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
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phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2 -methoxyethyl sugars or 2 -methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5 -methyl cytosine, 2 -deoxyuracil, or 7-deaza-2 -deoxyguanosine. . Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 

5 introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 

10 functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic CCYPR, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
15 sequences that anneal by base-pairing. For example, 5'-AGT-3' pairs with its complement, 
3-TCA-5*. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 

20 Compositions comprising polynucleotide sequences encoding CCYPR or fragments of CC YPR may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

25 "Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 

DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Foster City CA) in the 5' and/or the 3* direction, and resequence^ or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 

30 (University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The tabic below shows amino 
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10 



15 



20 



25 



30 



35 



40 



acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 

Original Residue - ' 



Ala 

Arg 

Asn 

Asp 

Cys 

Gin 

Glu 

Gly 1 

His 

He 

Leu 

Lys 

Met 

Phc 

Scr 

Thr 

Trp 

Tyr 

Val 



Conservative Substitution 
Gly, Ser 
His, Lys 
Asp, Gin, His 
Asn, Glu 

Ala, Ser ( 
Asn, Glu, His; 
Asp, Gin, His 
Ala 

Asn, Arg, Gin, Glu 
Leu, Val, 
lie, Val 
Arg, Gin, Glu 
Leu, lie 

His, Met, Leu, Trp, Tyr 

Cys, Thr 

Ser, Val 

Phe, Tyr 

His, Phe, Trp 

fie, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of CCYPR or the polynucleotide encoding CCYPR which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
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used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 
15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
5 from the first 250 or 500 amino acids (or first 25% of 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths arc exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:55-108 comprises a region of unique polynucleotide sequence that 
10 specifically identifies SEQ ID NO:55-108, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:55- 1 08 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:55-108 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:55-108 and the region of SEQ ID NO:55-108 to which the fragment corresponds are routinely 
15 determinable by one of ordinary skill in the art based on the intended purpose for the fragment 
A fragment of SEQ ID NO: 1-54 is encoded by a fragment of SEQ ID NO.55-108. A 
fragment of SEQ ID NO: 1-54 comprises a region of unique amino acid sequence that specifically 
identifies SEQ ID NO: 1-54. For example, a fragment of SEQ ID NO: 1-54 is useful as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1-54. 
20 The precise length of a fragment of SEQ ID NO:l-54 and the region of SEQ ID NO:l-54 to which the 
fragment corresponds are routinely determinable by one of ordinary skill in the art based on the 
intended purpose for the fragment. 

A "full-length" polynucleotide sequence is one containing at least a translation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codon. A "full- 
25 length*' polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity/' as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 
30 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e sequence 
35 alignment program. This program is part of the LASERGENE software package, a suite of molecular 
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biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 

3 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pair^ise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
5 selected as the default. ^ Percent identity is reported by CLUSTAL V as the "percent similarity" between 

aligned polynucleotide seguences. - ' 

( " t t 1 

Alternatively, a suite d commonly used and freely .available sequence compariso^algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
10 sources, including the NCBI, Bethesda, MD, and on the Internet at 

mtp^/ww.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at httpy/ww.ncbi.idm.nih.gov/gQrf^l2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 
20 Matrix: BLOSUM62 

Reward for match: 1 
Penalty for mismatch: -2 
Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
25 Expect: 10 

Word Size: 11 
Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 
30 the length of a fragment taken from a larger, defined sequence, for instance, a fragment of al least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 
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Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 
5 The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 

the percentage of residue matches between at least two polypeptide sequences aligned using a- 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge and hydrophobic! ty at the site of 
10 substitution, thus preserving the structure (and therefore Junction) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUST AL V algorithm as incorporated into the MEGALIGN version 3. 1 2e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUST AL V, the default parameters are set as follows: Ktuple=l , gap penalty=3, window=5, and 
15 "diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is repeated by CLUST AL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.12 
20 (Apr-21 -2000) with blastp set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: J J and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
25 Word Size: 3 

Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
30 a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (H ACs) are linear microchromosomes which may contain 
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DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the dements required for 
chromosome replication, segregation, and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen bihding regionsjias been altered so that the antibody more closely 
. resembles a huinan antibody, and. still retains its original binding ability. * * . 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing unto defined hybridization conditions. ^Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes fopn under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 
binding between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1 % (w/v) 
SDS, and about 100 ng/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 
for nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 1989, Molecular 
Cloning: A Laboratory Manual, 2 nd ed, vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1 % SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridizatioa Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 pg/ml. Organic solvent, such as 
formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
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conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

* The term "hybridization complex'* refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
5 complex may be formed in solution (e.g. f C 0 t or Rot analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
or their nucleic acids have been fixc$). ' ' 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
10 resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 
15 An "immunogenic fragment" is a polypeptide or oligopeptide fragment of CCYPR which is 

capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
CCYPR which is useful in any of the antibody production methods disclosed herein or known in the art. 
The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
20 or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of CCYPR. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
25 functional, or immunological properties of CCYPR. 

The phrases "nucleic acid" and 4, nucieic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 
30 "Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 

functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 
35 "Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
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comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
/ may be pegylated to extend their lifespan in thecal. , 
5 "Pust-translational modification" ot an CCYPR may involye lipidation, glycosylation, 

phosphorylation,, acetylation, racemization, proteolytic cleavage, ^and other modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
cell type depending on the enzymatic milieu of CCYPR. f ' 

"Probe" refers to nucleic acid sequences encoding CCyPR, their complements, or fragments 

10 thereof, which are u$ed to detect identical, allelic or rielated nucleic acid sequences. Probes are 

isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 
short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 

15 polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 

20 or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual. 2 nd ed., vol. 1-3, Cold 

25 Spring Harbor Press, Plainview NY; Ausubel, F.M. et al.,1987, Current Protocols in Molecular 
Biology . Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al., 1990, PCR 
Protocols. A Guide to Methods and Applications . Academic Press, San Diego C A PCR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 

30 MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
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programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
5 program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 

Cambridge MA) allows the user to input a "mispriming library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The' source code for the latter two primer selection programs may also be obtained from 
their respective sources and modified to meet the user's specific needs.) The PrimeGen program 

10 (available to the public from the UK Hunian Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 

15 above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
20 that is made by an artificial combination of two or more otherwise separated segments of sequence. 

This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 

artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 

such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 

been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
25 recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 

Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 
Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 

expressed, inducing a protective immunological response in the mammal. 
30 A "regulatory element** refers to a nucleic acid sequence usually derived from untranslated 

regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions (UTRs). 

Regulatory elements interact with host or viral proteins which control transcription, translation, or RN A 

stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
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amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemilumincscent, or chromogenic agents; substrates; cof^ctors; inhibitors; magnetic particles; and 

other moieties known in the art. " 

\ . V if-" * . . 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

5 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 

nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 

instead of deoxyribose. • , 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 

, acids encoding CCYPR, or fragments thereof, or CCYPR itself, may comprise a bodily fluid; an extract 

10 from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 

cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

synthetic binding composition. The interaction is dependent upon the presence of a particular structure 

15 of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 

example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 

A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 

reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

20 removed from their natural environment and are isolated or separated, and are at least 60% free, 

preferably at least 75% free, and most preferably at least 90% free from other components with which 

they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 

different amino acid residues or nucleotides, respectively. 

25 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 

microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 

trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 

30 or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 

cell. Transformation may occur under natural or artificial conditions according to various methods well 

known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 

into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 
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of host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term "traasformed" cells 
includes stably transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 

5 cells which express thb inserted DNA or RN A for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 

10 of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 

15 introduced into the host by methods known in the art, for example infection, transfection, 

transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms arc widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
20 least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" 

25 (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 
identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRNA processing. The corresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 

30 polypeptides generally will have significant amino acid identity relative to each other. A polymorphic 
variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for example, a certain population, a disease state, or a propensity for a disease state. 

35 A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 
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least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show,for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence * ( 
5 identity over a certain defined length of oiie of the polypeptides. 

THE INVENTION t , ; ' 

The invention is based oh the discovery of new human cell cycle and proliferation proteins 
(CCYPR), the polynucleotides encoding CCYPR, and the use of these compositions for the diagnosis, 
treatment, or prevention of immune, developmental, and cell signaling disorders, and cell proliferative ^ 

10 disorders including cancer. . ' 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 
CCYPR. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 
nucleic acids encoding each CCYPR were identified, and column 4 shows the cDNA libraries from 

15 which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA 

libraries. Clones for which cDN A libraries are not indicated were derived from pooled cDNA libraries. 
In some cases, GcnBank sequence identifiers arc also shown in column 5. The Incyte clones and 
GenBank cDNA sequences, where indicated, in column 5 were used to assemble the consensus 
nucleotide sequence of each CCYPR and are useful as fragments in hybridization technologies. 

20 The columns of Table 2 show various properties of each of the polypeptides of the invention: 

column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis along with relevant citations, all of 

25 which are expressly incorporated by reference herein in their entirety; and column 7 shows analytical 
methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 
protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
30 associated with nucleotide sequences encoding CCYPR. The first column of Table 3 lists the nucleotide 
SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These fragments are 
useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:55-108 and 
to distinguish between SEQ ID NO:55-108 and related polynucleotide sequences. The polypeptides 
encoded by these fragments are useful, for example, as immunogenic peptides. Column 3 lists tissue 
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categories which express CC YPR as a fraction of total tissues expressing CCYPR. Column 4 lists 
diseases, disorders, or conditions associated with those tissues expressing CCYPR as a fraction of total 
tissues expressing CCYPR. Column 5 lists the vectors used to subclone each cDNA library. Of 
particular note is the expression of SEQ ID NO:66 in inflammatory tissues. It should be noted that 

5 SEQ ID NO:76 was found to be expressed predominantly in nervous tissue. 

• The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding CCYPR were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

10 SEQ ID NO:61 maps to chromosome 5 within the interval from 141.40 to 142.60 

centiMorgans. This interval also contains gene(s) and/or EST(s) associated with corneal dystrophy 
and deafness. 

SEQ ID NO:73 maps to chromosome 2 within the interval from 73.80 to 83.50 centiMorgans. 
This interval also contains gene(s) and/or EST(s) associated with hereditary nonpolyposis colorectal 

15 carcinoma and Muir-Torre syndrome. SEQ ID NO:74 maps to chromosome 19 within the interval 
from 41.70 to 58.70 centiMorgans. SEQ ID NO:75 maps to chromosome 17 within the interval from 
62.90 to 64.20 centiMorgans. This interval also contains gene(s) and/or EST(s) located within the 
human breast cancer (BRCA1) gene region. SEQ ID NO:76 maps to chromosome 1 within the 
interval from 143.30 to 153.90 centiMorgans, to chromosome 3 within the interval from 156.20 to 

20 160.00 centiMorgans, and to chromosome X within the interval from 1 12.80 to 139.40 centiMorgans. 
The interval on chromosome X from 1 12.80 to 139.40 centiMorgans also contains gene(s) and/or 
EST(s) associated with X-linked agammaglobulinaemia. 

SEQ ID NO:77 maps to chromosome 23 within the interval from 173.60 to 179.80 
centiMorgans, and to chromosome 11 within the interval from 136.90 centiMorgans to q-terminus. 

25 SEQ ID NO:78 maps to chromosome 3 within the interval from 200.00 to 21 3.70 centiMorgans. 
SEQ ID NO:81 maps to chromosome 7 within the interval from 167.60 centiMorgans to q-terminus. 
SEQ ID NO:90 maps to chromosome 2 within the interval from 236.10 to 240.20 centiMorgans, to 
chromosome 3 within the interval from 16.50 to 43.00 centiMorgans, and to chromosome 6 within 
the interval from 124.20 to 126.50 centiMorgans. SEQ ID NO:91 maps to chromosome 2 within the 

30 interval from 22.40 to 40.70 centiMorgans. SEQ ID NO:98 maps to chromosome 8 within the 
interval from 40.30 to 60.00 centiMorgans. SEQ ID NO: 100 maps to chromosome 14 within the 
interval from 95.50 to 103.70 centiMorgans, and to chromosome 6 within the interval from 158.50 
centiMorgans to q-terminus. SEQ ID NO: 104 maps to chromosome 18 within the interval from 32.40 
to 42.70 centiMorgans. SEQ ID NO: 105 maps to chromosome 19 within the interval from 69.90 to 

35 8 1 .20 centiMorgans. 
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The invention also encompasses CC YPR variants. A preferred CCYPR variant is one which 
has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the CCYPR amino acid sequence, and which contains at least on6 functional or 
structural characteristic of CCYPR. J * 

5 The invention also encompasses polynucleotides which encode CCYPR. In a particular 

embodiment^ the invention encompasses a polynucleotide* sequence comprising a sequence selected from 
the group consisting of SEQ ID NO:53-108, which encodes CCYPR. The polynucleotide sequences of 
SEQ ID NQ:55-108, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 

10 is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding CCYPR. In 
particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding CCYPR. A particular aspect of the invention encompasses a variant of a polynucleotide 

15 sequence comprising a sequence selected from the group consisting of SEQ ID NO:55-108 which has at 
least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 
sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO:55-108. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of CCYPR. 

20 It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 

code, a multitude of polynucleotide sequences encoding CCYPR, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 

25 accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
occurring CCYPR, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode CCYPR and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring CCYPR undo- appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding CCYPR or 

30 its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding CCYPR and its derivatives without altering the encoded amino acid sequences 
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include the production of RNA transcripts having more desirable properties, such as a greater half-life, 
than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DN A sequences which encode CC YPR and 
CCYPR derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
5 synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding CCYPR or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of - 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
10 NO:55-108 and fragments thereof under various conditions of stringency. (See, e.g., Walri, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-51 1 .) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known, in the art and may be used to practice any of the 

15 embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of 

DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE Biosystems, 
Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 

20 automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and AB1 CATALYST 800 thermal cycler (PE 
Biosystems). Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system 
(PE Biosystems), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale 
CA), or other systems known in the art. The resulting sequences are analyzed using a variety of 

25 algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short Protocols in 
Molecular Biology , John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular 
Biology and Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding CCYPR may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 

30 such as promoters and regulatory elements. For example, one method which may be employed, 

restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
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known genomic locus and surrounding sequences. (See, e.g. t Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PGR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Appiic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). ' . • 
Additionally, one, may use PCR, nested primers, x and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/cxon junctions. For all PCR-based methods, primers.may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 
72°C. 

When screening for full-length cDNAs, it is preferable to use hbrarics that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 
does not yield a fun-length cDNA. Genomic libraries may be useful for extension of sequence into 5* 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophorctic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode CCYPR may be cloned in recombinant DNA molecules that direct expression of CCYPR, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express CCYPR. 

The nucleotide sequences of the present invention can be engineered using methods generally 
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known in the art in order to alter CC YPR-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides^ may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
5 mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECUL ARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et ai. (1999) Nat 

10 Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 

improve the biological properties of CCYPR, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures thatidentify those gene variants with the desired 

15 properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 

20 homologous genes in the same gene family, either from the same or different species, thereby 

maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding CCYPR may be synthesized, in whole or in part, 
using chemical methods well known in the art (See, e.g„ Caruthcrs, M.H. et ah (1980) Nucleic Acids 

25 Symp. Sex. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
CCYPR itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g„ 
Creighton, T. (1984) Proteins, Structures and Molecular Properties . WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 

30 using the ABI 43 1 A peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of 
CCYPR, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 

35 chromatography. (See, e.g., Chiez, R.M. andF.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
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The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active CCYPR, the nucleotide sequences encoding CCYPR or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
5 the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5* and 3' untranslated regions in the vector and in polynucleotide sequences 
encoding CCYPR. Such elements* may vary in their strength and specificity. Specific initiation signals 

may also be used to achieve more efficient translation of sequences encoding CCYPR. Such signals 

- *- 
10 include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 

sequences encoding CCYPR and its initiation codon and upstream regulatory sequences are inserted 

into the appropriate expression vector, no additional transcriptional or translational control signals may 

be needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, 

exogenous translational control signals including an in-frame ATG initiation codon should be provided 

15 by the vector. Exogenous translational elements and initiation codons may be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers 
appropriate for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. 
Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 

20 vectors containing sequences encoding CCYPR and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1 989) Molecular Cloning. A Laboratory 
Manual, Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

25 A variety of expression vector/host systems may be utilized to contain and express sequences 

encoding CCYPR. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 

30 tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g„ Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supra : Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 
Scorer/C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard. E.K. et al. (1994) Proc. Natl. 
Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Thcr. 7:1937-1945; Takamatsu, 
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N. (1987) EMBO J. 6:307-31 1; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224: 838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The 
McGfaw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, 

5 JJ. et al. (1997) Nat. Genet 15:345-355.) Expression vectors derived from retroviruses, 

adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et all (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. 
USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 

10 (1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
-The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding CC YPR. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding CCYPR can be achieved using a 

15 multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
(Life Technologies). Ligation of sequences encoding CCYPR into the vector's multiple cloning site 
disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 

20 deletions in the cloned sequence. (See, e.g„ Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of CCYPR arc needed, e.g. for the production of antibodies, 
vectors which direct high level expression of CCYPR may be used. For example, vectors containing ' 
the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of CCYPR. A number of vectors 

25 containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 

promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra : 
Bitter, supra; and Scorer, supra .) 

30 Plant systems may also be used for expression of CCYPR. Transcription of sequences 

encoding CCYPR may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone 
or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-3 1 1 ). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, supra ; Broglie, supra : and Winter, supra .) These 

35 constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 
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transfection. (See, e.g. t The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, 

New York NY, pp. 191-196.) 

* - ! - * 

In mammalian cells, a number of viral-based expression systems may be utilized In cases 

where an adenovirus is used as an expression vector, sequences encoding CCYPR may be ligated into 

5 an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 

r sequence. Insertion in a non-essential El cm- E3 region of the virkl genome may be used to obtain 

infective virus which expresses CCYPR in host cells. £See, e.g. ( Logan, L and T. Shenk (1984) Proc. 

Natl. Acad Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 

(virus (RSV) enhancer, maybe used to increase expression in mammalian host cells. SV40 or EB V- - 

10 based vectors may also be used for high-level protein expressioa 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g. f Harrington, JJ. et al. (1997) Nat. Genet. 15:345-355.) 

15 Pot long term production of recombinant proteins in mammalian systems, stable expression of 

CCYPR in cell lines is preferred. For example, sequences encoding CCYPR can be transformed into 
cell lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 

20 being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 

25 but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in ik and apr cells, respectively. (See, e.g., Wigler, M. ct al. (1977) Cell 1 1:223-232; 
Lowy, 1. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selectioa For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to 

30 chlorsulfiiron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. ct al. (1980) 
Proc. Natl. Acad Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), 8 
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glucuronidase and its substrate B-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, e.g„ Rhodes, C.A. 
(1995) Methods Mol. Biol. 55:121-131.) 
5 Although the presence/absence of marker gene expression suggests that the gene of interest is 

also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding CCYPR is inserted within a marker gene sequence, transformed cells containing 
sequences encoding CC YPR' can be identified by the absence of marker gene functioa Alternatively, a 
marker gene can be placed in tandem with a sequence encoding CCYPR under the control of a single 
10 * promoter. Expression of thp marker gene in response to induction or selection usually indicates* 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding CCYPR and that express 
CCYPR may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
15 amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of CCYPR using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
20 activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 

antibodies reactive to two non-interfering epitopes on CCYPR is preferred, but a competitive binding 
assay may be employed These and other assays are well known in the art. (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, Sect, IV; Coligan, LE. 
et al. (1997) Current Protocols in Immunology . Greene Pub. Associates and Wiley-Intersciencc, New 
25 York NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding CCYPR include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
30 Alternatively, the sequences encoding CCYPR, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RN A polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
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(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for case 
of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the like. c 

Host cells transformed with nucleotide sequences encoding CCYPR may be cultured under 
conditions suitable for the expression and recovery of the protein from ceU culture. The protein ■ 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the aft, expression vectors containing 
polynucleotides which encode CCYPR may be designed to contain signal sequences which direct 
secretion of CCYPR through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 

i 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-transIaUonal processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 
Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding CCYPR may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric CCYPR protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of CCYPR activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resias, 
respectively. FLAG, c-myc y and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the CCYPR encoding sequence and the heterologous protein sequence, so that CCYPR 
may be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of commercially 

37 

SUBSTITUTE SHEET (RULE 26) 



WO 01/07471 



PCT/US00/19948 



available kits may also be used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled CCYPR may be achieved in 
vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). Thfese systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 
5 SP6, promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for : 
example, ^S-methionine. * , . ■ \ 

CCYPR of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to CCYPR. At least one and up to a plurality of test compounds may be 
screened for specific binding to CCYPR. Examples of test compounds include antibodies, 

10 oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
CCYPR, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which CCYPR 

15 binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 

compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express CCYPR, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila . or E. 
coli. Cells expressing CCYPR or cell membrane fractions which contain CCYPR are then contacted 

20 with a test compound and binding, stimulation, or inhibition of activity of either CCYPR or the 
compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with CCYPR, cither in 
25 solution or affixed to a solid support, and detecting the binding of CCYPR to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

30 CCYPR of the present invention or fragments thereof may be used to screen for compounds 

that modulate the activity of CCYPR. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
CCYPR activity, wherein CCYPR is combined with at least one test compound, and the activity of 
CCYPR in the presence of a test compound is compared with the activity of CCYPR in the absence 

35 of the test compound. A change in the activity of CCYPR in the presence of the test compound is 
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indicative of a compound that modulates the activity of CCYPR. Alternatively, a test compound is 

combined with an in vitro or cell-free system comprising CCYPR under conditions suitable for 

CCYPR activity, and the assay is performed. In either of these assays, a test compound which 

i modulates the activity of CCYPR may do so indirectly and need not come in direct contact with the 

5 test compound. At least one and up to a plurality of test compounds may be screened^ v 
1 f In another embodiment, polynucleotides encoding CCYPR or their mammalian homologs 
may be "knockqdrout" in an animal model system, using homologous recombination in embryonic 
stem (ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human;disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 

10 example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. 
(1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 

15 the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stages-specific 
manner (Maith, J.D. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 
blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 

20 heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding CCYPR may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 

25 into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1 145-1 147). 

Polynucleotides encoding CCYPR can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding CCYPR is in jected into animal ES cells, and the injected 

30 sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to ovcrexpress CCYPR, e.g., by secreting CCYPR in its milk, may 
also serve as a convenient source of that protein (Jannc, J. et at. (1998) Biotechnol. Annu. Rev. 4:55- 

35 74). 
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THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of CCYPR and cell cycle and proliferation proteins. In addition, the expression Of 
CCYPR is closely associated with inflammation, trauma, cell proliferation and cancer. Therefore, 
CCYPR appears to play a role in immune, developmental, and cell signaling disorders, and 7 cell 
proliferative disorders including cancer, in the treatment of disorders associated with increased 
CCYPR expression or activity, it is desirable to decrease the expression or activity of CCYPR. In the 
treatment of disorders associated with decreased CCYPR expression or activity, it is desirable to 
increase the expression or activity of CCYPR. 

Therefore, in one embodiment, CCYPR or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CCYPR. Examples of such disorders include, but are not limited to, an immune disorder 
such as inflammation, actinic keratosis, acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED) T bronchitis, bursitis, 
cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 
disease, Hashimoto's thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, 
irritable bowel syndrome, mixed connective tissue disorder (MCTD), multiple sclerosis, myasthenia 
gravis, myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, 
pancreatitis, polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, 
scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic 
sclerosis, primary thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fungal, parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer including 
lymphoma, leukemia, and myeloma; a developmental disorder such as renal tubular acidosis, anemia, 
Cushing's syndrome, achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, 
gonadal dysgenesis, WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and 
mental retardation), Smith-Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial 
dysplasia, hereditary keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and 
neurofibromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndcnham's chorea 
and cerebral palsy, spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, 
sensorineural hearing loss, and disorders of immune cell activation; a cell signaling disorder including 
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endocrine disorders such as disorders of the hypothalamus and pituitary resulting from lesions such as 
primary brain tumors, adenomas, infarction associated with pregnancy, hypophysectomy. aneurysms, 
vascular malformations, thrombosis, infections, immunological' disorders, and complications due to 
head trauma; disorders associated with hyperpituitarism including acromegaly, giantism, and 

5 syndrome of inappropriate antidiuretic hormone (ADH) secretion (SIADH) often caused by bpnign 
adenoma; disorders associated with hypothyroidism including goiter, myxedema, acute thyroiditis 
associated with bacterial infection; disorders associated with hyperparathyroidism including Conn 
disease (chronic hypcrcalemia); pancreatic disorders such as Type 1 or Type II diabetes mellitus and 
associated complications; disorders associated with the adrenals such as hyperplasia, carcinoma, or 

10 adenoma of the adrenal cortex, hypertension associated with alkalosis; disorders associated with 

gonadal steroid hormones such as: in women, abnormal prolactin production, infertility, including 

i 

tubal disease, ovulatory defects, and endometriosis, perturbations of the menstrual cycle, polycystic 
ovarian disease, ovarian hyperstimulation syndrome, an endometrial or ovarian tumor, a uterine 
fibroid, autoimmune disorders, an ectopic pregnancy, teratogenesis, hypcrprolactinemia, isolated 

15 gonadotropin deficiency, amenorrhea, galactorrhea, hermaphroditism, hirsutism and virilization, 
breast cancer, and fibrocystic breast disease; and, in post-menopausal women, osteoporosis; and, in 
men, Leydig cell deficiency, male climacteric phase, germinal cell aplasia, hypergonadal disorders 
associated with Leydig cell tumors, androgen resistance associated with absence of androgen 
receptors, syndrome of 5 a -reductase, a disruption of spermatogenesis, abnormal sperm physiology, 

20 cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, 
impotence, carcinoma of the male breast, and gynecomastia; and a cell proliferative disorder such as 
actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue 
disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 

25 myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing CCYPR or a fragment or derivative 

30 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of CCYPR including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
CCYPR in conjunction with a suitable pharmaceutical carrier may he administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of CCYPR including, but not 

35 limited to, those provided above. 
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In still another embodiment, an agonist which modulates the activity of CC YPR may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of CC YPR including, but not limited to, those listed above. 

In a further embodiment, an antagonist of CCYPR may be administered to a subject to treat or 
5 prevent a disorder associated with increased expression or activity of CCYPR. Examples of such 
disorders include, but are not limited to, those immune, developmental, and cell signaling disorders, and 
cell proliferative disorders including cancer, described above. In one aspect, an antibody which 
specifically binds CCYPR may be used directly as an antagonist or indirectly as a targeting or delivery 
mechanism for bringing a pharmaceutical agent to cells or tissues which express CCYPR. 

10 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding CCYPR may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of CCYPR including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 

15 therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 

20 An antagonist of CCYPR may be produced using methods which are generally known in the 

art. In particular, purified CCYPR may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind CCYPR. Antibodies to CCYPR may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 

25 fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with CCYPR or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 

30 increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Gucrin) and Corynebacterium parvum arc especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
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CCYPR have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that thesq oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches of 
CCYPR amino acids may be fused with those of another protein, such as KLH, ^and antibodies to the 
5 chimeric molecule may be produced. „ ' 

Monoclonal antibodies to CCYPR may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, ag., Kohier, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

10. Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
r Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, c.g. t Morrison, S.L. et al. (1984) Proc. 

15 Natl. Acad, Sci. USA 81 :6851-6855; Neuberger, M:S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce CCYPR-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g„ Burton, 

20 DR. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, c.g., Qrlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

25 Antibody fragments which contain specific binding sites for CCYPR may also be generated. 

For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the 
F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 

30 (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between CCYPR and its 
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specific antibody. A two-site, monoclonal^based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering CCYPR epitopes is generally used, but a competitive binding assay may also be 
employed (Pound, supra) . 

Various methods «uch as Scatchard analysis in conjunction with radioimmunoassay techniques 
5 may be used to assess the affinity of antibodies for GCYPR. Affinity is expressed as an association 
constant, K*, which is defined as the molar concentration of CCYPR-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The K., determined 
7 for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple 
CC YPR epitopes, represents the average affinity, or avidity, of the antibodies for CCYPR. The K* 
10 determined for a preparation ;Of monoclonal antibodies, which are monospecific for a particular CCYPR 
epitope, represents a true measure of affinity. High-affinity antibody preparations with ranging from 
; about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the CCYPR-antibody complex 
. must withstand rigorous manipulations. Low-affinity antibody preparations with ranging from 
about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar procedures which 
15 ultimately require dissociation of CCYPR, preferably in active form, from the antibody (Catty, D. 
(1988) Antibodies, Volume I: A Practical Approach , IRL Press, Washington DC; Liddell, J.E. and A 
Cryer (1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
20 polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of CCYPR-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra, and 
Coligan et al.. supra .) 

25 In another embodiment of the invention, the polynucleotides encoding CCYPR, or any fragment 

or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding CCYPR. 
Such technology is well known in the art, and antisense oligonucleotides or larger fragments can be 

30 designed from various locations along the coding or control regions of sequences encoding CCYPR. 
(Sec, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics . Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
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complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. etal. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13):1288-1296.)i Antisense sequences can also be^ introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno- associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra; Uckert; W. and W, Wallher (1994) Pharmacol. Then 63(3):323-347.) Other 
gene delivery mechanisms include liposome-depved systems, artificial viral envelopes, and other 
systems known in the art (See, e.g., Rossi, J;J. (1995) Br, Med. Bull. 51(l):217-225;'Boado, RJ. et 
al. (1998) J! Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) v . ' 

. In another embodiment of the invention, polynucleotides encoding CCYPR may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genedc deficiency 
(e.g. f in the cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X-linked 
inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and Somia, N. (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g„ 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad Sci. USA 93:11395-11399), 
hepatitis B or C virus (HBV, HC V); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzH . In the 
case where a genetic deficiency in CCYPR expression or regulation causes disease, the expression of 
CCYPR from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
CCYPR are treated by constructing mammalian expression vectors encoding CCYPR and introducing 
these vectors by mechanical means into CCYPR-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) 
the use of DNA transposons (Morgan, R.A and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191- 
217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. Opia BiotechnoL 
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9:445^50). 

Expression vectors that may be effective for tfic expression of CC YPR include, but are not 
limited to, the PCDNA 3.1, EP1TAG, PRCCMV2, PREP, PV AX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
5 PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). CCYPR may be expressed 
using (i) a constitutively active promoter, (e.g„ from cytomegalovirus (CMV), Rous sarcoma virus 
(RSV), S V40 virus, thymidine kinase (TK), or (i-actin genes), (ii) an inducible promoter (e.g., the 
tetracycline-regulated promote- (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA . 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1760-1769; Rossi, F.M.V. and H.M. Blau (1998) 

10 Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and H.M. Blau, supra) ), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding CCYPR from a normal individual. 

15 Commercially available liposome transformation kits (e.g., the PERFECT LIPID 

TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and AJ. Eb (1973) Virology 52:456-467), or by clectroporation (Neumann, E. et al. 

20 (1 982) EMBO J. 1 :841 -845). The introduction of DN A to primary cells requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to CCYPR expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding CCYPR under the control of an independent promoter or the retrovirus long 

25 terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus ci'j-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in an 

30 appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armcntano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A and 
AD. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
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retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses a 

method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 

- t 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T-cells), and the 

return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 

5 therapy and have been well documented (Ranga, U. et ai; (1997) J. Virol. 71 :7020-7029* Bauer, G. et 

al. (1997) Blood 89:2259-2267; Bonyhadi, M.U (1997) J. Virol. 71:4707-4716; Ranga, U. et al. 

(1998) Proc. Natl. Acad. Sci. USA95:1201-i206; Su, L. (1997) Blood 89:2283.-2290). 

In the alternative, an adenovirus -based gene therapy delivery system is used to deliver 

polynucleotides encoding CCYPR to cells which have one or more genetic abnormalities with respect to 

10 the expression of CCYPR. The ponstruction and packaging of adenovirus-based vectors are well 

known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 

15 hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) Annu. 
Rev. Nutr. 19:5 1 1-544; and Verma, l.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding CCYPR to target ceils which have one or more genetic abnormalities with 

20 respect to the expression of CCYPR. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing CCYPR to cells of the central nervous system, for which HS V has a 
tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. ct al. (1999) Exp. Eye 

25 Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,41 3 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by this 

30 patent arc the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. (1994) Dev. 
Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus 
sequences, the generation of recombinant virus following the transfection of multiple plasmids 
containing different segments of the large herpesvirus genomes, the growth and propagation of 
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herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of ordinary 
skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding CCYPR to target cells. The biology of the prototypic alphavirus, 
5 Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on 
the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotech. 9:464-469). During alphavirus 
RNA replication, a subgenoniic RNA is generated that normally encodes the viral capsid proteins. This, 
subgenomic RNA replicates to higher levels than the full-length genomic RNA, resulting in the 
overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g. f protease 

10 and polymerase). Similarly, inserting the coding sequence for CCYPR into the alphavirus genome in 
place of the capsid-coding region results in the production of a large number of CCYPR-coding RNAs 
and the synthesis of high levels of CCYPR in vector transduced cells. While alphavirus infection is 
typically associated with cell lysis within a few days, the ability to establish a persistent infection in 
hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic 

15 replication of alphaviruses can be altered to suit the needs of the gene therapy application (Dryga, S. A. 
et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of 
CCYPR into a variety of cell types. The specific transduction of a subset of cells in a population may 
require the sorting of cells prior to transduction. The methods of manipulating infectious cDN A clones 
of alphaviruses, performing alphavirus cDNA and RNA transfections, and performing alphavirus 

20 infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 

25 transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches . Futura Publishing Mt. KiscoNY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

30 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding CCYPR. 
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Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for { 

5 secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. . 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 

10 chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 

Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DN A sequences 
encoding CCYPR. Such DNA sequences may be incorporated into a wide variety of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or 

15 tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3* ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 

20 extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding CCYPR. 

25 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but arc not limited to, oligonucleotides, antisensc oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 

30 promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
CCYPR expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding CCYPR may be therapeutically useful, and in the treament of disorders 
associated with decreased CCYPR expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding CCYPR may be therapeutically useful. 

35 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
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altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
5 based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding CCYPR is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding CCYPR are assayed 

10 by any method commonly known in the art Typically, the expression of a specific nucleotide is 

detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding CCYPR. The amount of hybridization may be quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 

15 exposed to a test compound indicates that the test compound is effective in altering the expression of 
the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Schizosaccharomvces pombe gene expression 
system (Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids 
Res. 28.E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. 

20 Res. Commun. 268:8-1 3). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

25 Many methods for introducing vectors into ceils or tissues are available and equally suitable for 

use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 

30 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
35 composition which generally comprises an active ingredient formulated with a pharmaceutically 

50 



SUBSTITUTE SHEET (RULE 26) 



WO 01/07471 PCT/US00/19948 

acceptable excipient Excipients may include, for example, sugars, starches, celluloses, gums, and 
proteins. Various formulations are commonly known and are thoroughly discussed in the latest edition 
of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such pharmaceutical 
compositions may consist of CCYPR, antibodies to CGYPR, and raimetics, agonists, antagonists, or 

5 inhibitors of CCYPR. 

The pharmaceutical compositions utilized in this invention may be administered by any number 
of routes including, but not limited to, dral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, 
topical, sublingual, or rectal means. ? - ^ ' , 

10 Pharmaceutical compositions for pulmonary administration may be prepared in liquid or dry 

powder form. These compositions are generally aerosolized immediately prior to inhalation by the 
patient. In the case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol 
delivery of fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger 
peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar region of 

15 the lung have enabled the practical delivery of drugs such as insulin to blood circulation (sec, e.g., 

Patton, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration 
without needle injection, and obviates the need for potentially toxic penetration enhancers. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein the 
active ingredients are contained in an effective amount to achieve the intended purpose. The 

20 determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of pharmaceutical compositions may be prepared for direct intracellular 
delivery of macromolecules comprising CCYPR or fragments thereof. For example, liposome 
preparations containing a cell-impermeable macromolecule may promote cell fusion and intracellular 
delivery of the macromolecule. Alternatively, CCYPR or a fragment thereof may be joined to a short 

25 cationic N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been 

found to transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, 
S.R. et al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 

30 or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example CCYPR 
or fragments thereof, antibodies of CCYPR, and agonists, antagonists or inhibitors of CCYPR, which 
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ameliorates the symptoms or conditioa Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 30 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
5 therapeutic index, which can be expressed as the \£>yJED^ ratio. Pharmaceutical compositions which 
exhibit large therapeutic indices are preferred, The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 with little 
or no toxicity. The dosage varies within this range depending upon the dosage form employed, the 

10 sensitivity of the patient, and the route of administratioa 

The exact dosage will be determined by the practitioner, in light of factors related to the subject 
requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 

15 and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. 
Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 ^g to 100,000 /ig, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 

20 methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

25 In another embodiment, antibodies which specifically bind CC YPR may be used for the 

diagnosis of disorders characterized by expression of CCYPR, or in assays to monitor patients being 
treated with CC YPR or agonists, antagonists, or inhibitors of CCYPR. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for CCYPR include methods which utilize the antibody and a label to detect CCYPR 

30 in human body fluids or in extracts of cells or tissues. The antibodies may be used with or without 
modification, and may be labeled by covalent or non-covalent attachment of a reporter molecule. A 
wide variety of reporter molecules, several of which are described above, are known in the art and may 
be used. 

A variety of protocols for measuring CCYPR, including ELIS As, Rl As, and FACS, are known 
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in the art and provide a basis for diagnosing altered or abnormal levels of CC YPR expression. Normal 

or standard values for CCYPR expression are established by combining body fluids or cell extracts 

taken from normal mammalian subjects, for example, human subjects, with antibody to CCYPR under 

... " . ' ' * s 

conditions suitable for complex formation. The amount of standard complex formation may be ' 

( * ■: 

5 quantitated by various methods, such as photometric means. Quantities of CCYPR expressed in 
subject, control, ana disease samples from biopsied tissues are compared'with the standard values. 
' Deviation between standard and subject values^establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding CCYPR may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 

10 complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of CCYPR may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
CCYPR, and to monitor regulation of CCYPR levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 

15 sequences, including genomic sequences, encoding CCYPR or closely related molecules may be used to 
identify nucleic acid sequences which encode CCYPR. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding CCYPR, allelic variants, or related 

20 sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the CCYPR encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:55-108 or from 
genomic sequences including promoters, enhancers, and introns of the CCYPR gene. 

25 Means for producing specific hybridization probes for DNAs encoding CCYPR include the 

cloning of polynucleotide sequences encoding CCYPR or CCYPR derivatives into vectors lor the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 

30 of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as 
alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding CCYPR may be used for the diagnosis of disorders 
associated with expression of CCYPR. Examples of such disorders include, but arc not limited to, an 
immune disorder such as inflammation, actinic keratosis, acquired immunodeficiency syndrome 
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(AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, 
autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy 
(APECED), bronchitis, bursitis, cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic 
dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with 
lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 
Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, paroxysmal nocturnal 
hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel syndrome, mixed connective tissue 
disorder (MCTD), multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, 
myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, polycythemia vera, polymyositis, psoriasis, 
Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, 
systemic lupus erythematosus, systemic sclerosis, primary thrombocythemia, thrombocytopenic 
purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, 
trauma, and hematopoietic cancer including lymphoma, leukemia, and myeloma; a developmental 
disorder such as renal tubular acidosis, anemia, Cushing's syndrome, achondroplastic dwarfism, 
Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome (Wilms' 
tumor, aniridia; genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, 
myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary keratodermas, hereditary 
neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, 
hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, spina bifida, 
anencephaly, craniorachischisis, congenital glaucoma, cataract, sensorineural hearing loss, and 
disorders of immune cell activation; a cell signaling disorder including endocrine disorders such as 
disorders of the hypothalamus and pituitary resulting from lesions such as primary brain tumors, 
adenomas, infarction associated with pregnancy, hypophysectomy, aneurysms, vascular 
malformations, thrombosis, infections, immunological disorders, and complications due to head 
trauma; disorders associated with hyperpituitarism including acromegaly, giantism, and syndrome of 
inappropriate antidiuretic hormone (ADH) secretion (SIADH) often caused by benign adenoma; 
disorders associated with hypothyroidism including goiter, myxedema, acute thyroiditis associated 
with bacterial infection; disorders associated with hyperparathyroidism including Conn disease 
(chronic hypercalemia); pancreatic disorders such as Type I or Type II diabetes mellitus and 
associated complications; disorders associated with the adrenals such as hyperplasia, carcinoma, or 
adenoma of the adrenal cortex, hypertension associated with alkalosis; disorders associated with 
gonadal steroid hormones such as: in women, abnormal prolactin production, infertility, including 
tubal disease, ovulatory defects, and endometriosis, perturbations of the menstrual cycle, polycystic 
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ovarian disease, ovarian hyperstimulation syndrome, an endometrial or ovarian tumor, a uterine 
fibroid, autoimmune disorders, an ectopic pregnancy, teratogenesis, hyperprolactinemia, isolated 
gonadotropin deficiency, amenorrhea, galactorrhea, hermaphroditism, hirsutism and virilization, 
breast cancer, and fibrocystic breast disease; and, in post-menopausal women, osteoporosis; and, in 
5 men, Leydig cell deficiency, male climacteric phase, germinal cell aplasia, hypergonadal disorders 
associated with Leydig cell tumors, androgen resistance associated with absence of androgen 
receptors, syndrome*of.5 a-reductase, a disruption of spermatogenesis, abnormal sperm physiology, 
cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, 
impotence, carcinoma of the male breast, and gynecomastia; and a cell proliferative disorder such as 

10 actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue 
disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 

15 lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 

thymus, thyroid, and uterus. The polynucleotide sequences encoding CCYPR may be used in Southern 
or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in dipstick, 
pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to 
detect altered CCYPR expression. Such qualitative or quantitative methods are well known in the art. 

20 In a particular aspect, the nucleotide sequences encoding CCYPR may be useful in assays that 

detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding CCYPR may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 

25 value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
sample then the presence of altered levels of nucleotide sequences encoding CCYPR in the sample 
indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
treatment of an individual patient. 

30 In order to provide a basis for the diagnosis of a disorder associated with expression of 

CCYPR, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding CCYPR, under conditioas suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from normal 

35 subjects with values from an experiment in which a known amount of a substantially purified 
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polynucleotide is used. Standard values obtained in this manner may be compared with values obtained 
from samples from patients who are symptomatic for a disorder. Deviation from standard values is 
used to establish the presence of a disorder. ' 

Once the presence of a disorder is established and a treatment protocol is initiated, 

5 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. r : . 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

10 ovcrexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

15 Additional diagnostic uses for oligonucleotides designed from the sequences encoding CCYPR 

may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, 
or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding 
CCYPR, or a fragment of a polynucleotide complementary to the polynucleotide encoding CCYPR, and 
will be employed under optimized conditions for identification of a specific gene or condition. 

20 Oligomers may also be employed under less stringent conditions for detection or quantification of 
closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding CCYPR may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 

25 in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding CCYPR are used to amplify DNA using the 
polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary 

30 and tertiary structures of PCR products in single-stranded form, and these differences are detectable 
using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are 
fluorescently labeled, which allows detection of the amplimers in high-throughput equipment such as 
DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 
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overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and^automated analyses of DNA sequence chromatograms. In the 
alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the high 
5 throughput MASS ARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of CCYPR include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating, results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
1 0 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometry or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 

15 can be used in transcript imaging techniques which monitor the relative expression levels of large 

numbers of genes simultaneously as described in Seilhamer, JJ. et al., "Comparative Gene Transcript 
Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 

20 progression/regression of disease as a function of gene expression, and to develop and monitor the 

activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient. For example, therapeutic agents which are highly effective and 
display the fewest side effects may be selected for a patient based on his/ha- pharmacogenomic profile. 

25 * n another embodiment, antibodies specific for CCYPR, or CCYPR or fragments thereof may 

be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 

30 gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at a 
given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent Number 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
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transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity 
5 Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 

* or other biological- samples. The transcript image may thus reflect gene expression in vivo , as in the 
case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
may also be used in conjunction with in vitro model systems and preclinical evaluation of 

10 pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
(Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) 
Toxicol. Lett. 112-113 :467-47 1 , expressly incorporated by reference herein). If a test compound has a 

15 Signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome- wide measurement of expression 
provides the highest quality signature. Even genes whose expression is not altered by any tested 
compounds are important as well, as the levels of expression of these genes are used to normalize the 

20 rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released 

25 February 29, 2000, available at http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 

30 biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

35 Another particular embodiment relates to the use of the polypeptide sequences of the present 
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invention to analyze the proteome of a tissue or cell type. The term pfoteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 
analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. :In one embodiment, the separation is 
achieved using two-dimfensional gel electrophoresis, in which proteins from a sample are separated by 
isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, sunra) . The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
generally proportional to the level of the protein in the sample. The optical densities of equivalently 
positioned proteiii spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment. The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for CCYPR to quantify the 
levels of GCYPR expression. In one embodiment, the antibodies are used as dements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting the 
levels of protein bound to each array element (Lueking, A. et al. ( 1 999) Anal. Biochem. 270: 1 03- 1 1 1 ; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 
be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and J. 
Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 
In another embodiment, the toxicity of a test compound is assessed by treating a biological 
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sample containing proteins with the test compound. Proteins that are expressed iii the treated biological 
sample are separated so that the amount of each protein can be quantified. The amount of each protein 
" is compared to the amount of thp corresponding protein in an untreated biological sample. A difference 
in the amount of protein between the two samples is indicative of a toxic response to the test compound 
5 in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the^polypeptides of the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 

10 by the antibodies is quantified The amount of protein in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prepared used and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad Sci. 

15 USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Hello:, R.A. et al. (1997) Proc. Natl. Acad Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Microarrays: A Practical Approach . M. Schena, ed (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

20 In another embodiment of the invention, nucleic acid sequences encoding CCYPR may be used 

to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 

25 mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 

chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 

30 7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP). (See, e.g., 
Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 

35 data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
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data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding CCYPR on a physical 
map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DNA associated with that disorder and thus may further positional cloning efforts. . 

! 5 * In situ hybridization of Chromosomal preparations and physical mapping techniques, such as 

linkage analysis using established chromosomal markers,.may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 

10 techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 

by genetic linkage to a particular genomic region, e.g„ ataxia-telangiectasia to ^1 lq22-23, any sequences 

mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g. f 

* * 
Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 

also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 

15 among normal, carrier, or affected individuals. 

In another embodiment of the invention, CCYPR, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 

20 between CCYPR and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with CCYPR, or fragments thereof, 

25 and washed. Bound CCYPR is then detected by methods well known in the art. Purified CCYPR can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
30 antibodies capable of binding CCYPR specifically compete with a test compound for binding CCYPR. 
In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with CCYPR. 

In additional embodiments, the nucleotide sequences which encode CCYPR may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
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properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and' publications mentioned above and below, in 
particular U.S. Sen No. 60/145,075, U.S. Ser. No. 60/153,129; and U.S. Ser. No. "60/1 64,647, are 
hereby expressly incorporated by reference. 

■■■■ . y 

EXAMPLES ' 
I. Construction of cDNA Libraries ; 

RN A was purchased from Clontech or isolated from tissues described in Table 4; Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed 
in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T>coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDN A was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL SI 000, SEPH AROSE CL2B, or SEPH AROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2.1 plasmid 
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(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant 
plasmids were transformed into competent E. coli cells including XL1 -Blue, XLl-BlueMRF, or SOLR 
from Stratagene or DH5a, DH10B, or Electrcdtf AX DH10B from Life Technologies. 
II. Isolation of cDNA Clones , 
5 ^ Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 
using the UNIZAP vector system (Strat&gene) or by cell lysis. Plasmids were purified using at least 
one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC 
Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and'QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E. A.L. PREP 96 plasmid 

10 purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of 
; distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct^link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 

15 well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystcms Oy, Helsinki, Finland). 
HI. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 

20 Sequencing reactions were processed using standard methods or high-throughput instrumentation 

such as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler (MJ 
Research) in conjunction with the HYDRA microdispenser (Robbias Scientific) or the MICROLAB 
2200 (Hamilton) liquid transfer system cDN A sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI 

25 PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic 
separation of cDNA sequencing reactioas and detection of labeled polynucleotides were carried out 
using the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 
377 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calling 
software; or other sequence analysis systems known in the art. Reading frames within the cDNA 

30 sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 7.7). Some 
of the cDNA sequences were selected for extension using the techniques disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
an. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 
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references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 
algorithms used, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
5 the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco C A) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 

10 also calculates the percent identity between aligned sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA sequences 
and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleotide nearest neighbor analysis. The seqSences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 

15 eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 
using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full 
length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 

20 and these full length sequences were subsequently analyzed by querying against databases such as the 
GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 
probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., 
Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

25 The programs described above for the assembly and analysis of full length polynucleotide and 

amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:55-108. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

30 Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, 
supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
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molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. ; In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: ? " * * 

BLAST Score x Percent Identity 
< 5 f x minimum {length(Seq. 1), length(Seq. 2)} * ' 

The produa score takes4nto account both the degree of similarity between two sequences and the length 
Of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the IBLAST score is multipUed by the percent nucleotide identity and the product is divided 
by (5 times the length of the shorter of the two sequences). The BLAST score i§ calculated by 
assigning a score of +5 for every base that matchies in a high-scoring segment pair (HSP), and -4 tor 
every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment. For 
example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared A product score of 70 is produced either by 100% identity and 
70% overlap at one end, or by 88% identity and 1 00% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding CCYPR occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
3. 

V. Chromosomal Mapping of CCYPR Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:55-108 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:55-108 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
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from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (W1GR), and G6n6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 
5 The genetic map locations of SEQ ID NO:61 , SEQ ID NO:73, SEQ ID NO:74, SEQ ID 

NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:81, SEQ ID NO:90, SEQ ID 
NO:91, SEQ ID NO:98, SEQ ID NO: 100, SEQ ID NO: 104, and SEQ ID NO: 105 are described in 
The Invention as ranges, or intervals, of human chromosoines. More than one map location is 
reported for SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:90, and SEQ ID NO: 100, indicating that 

10 previously mapped sequences having similarity, but not complete identity, to SEQ ID NO:76, SEQ 
ID NO:77, SEQ ID NO:90, and SEQ ID NO: 100 were assembled into their respective clusters. The 
map position of an interval, in centiMorgans, is measured relative to the terminus of the 
chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 

15 (Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by G6n6thon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public, such as the NCBI ,, GeneMap , 99 tt World Wide Web 
site (http://www.ncbi.nlm.nih.gov/gcnemap/), can be employed to determine if previously identified 

20 disease genes map within or in proximity to the intervals indicated above. 
VI. Extension of CCYPR Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:55-108 were produced by extension of an 
appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 

25 primer, to initiate 3' extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68 °C to about 72°C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

30 Selected human cDNA libraries were used to extend the sequence. If more than one extension 

was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PGR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 \ (NH 4 ) 2 S0 4 , 

35 and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONG ASE enzyme 
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(Life Technologies), and Pfii DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1 : 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2/3, and 4 repeated.20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94 °C, 3 min; Step 2: 
5 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2/3, and 4 repeated 20 times; 
Step 6: 68°C, 5 min; Step 7: storage at 4°C. - . * \ * 

The concentration of DNA in each well was determined by dispensing 100 ^il PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 

' and 0.5 jll! of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 

10 Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA A 5 ij\ to 10 /-d aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 
The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 

15 digested with CvUI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amcrsham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligasc (New England Biolabs, Beverly MA) into pUC 1 8 vector (Amersham 

20 Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and trans fected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37 °C in 384-well plates in LB/2x 
carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 
25 Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 

94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60 9 C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72 °C, 5 min; Step 7: storage at 4°C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1 :2, 
30 v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
sequencing ready reaction kit (PE Biosystems). 

In like manner, the polynucleotide sequences of SEQ ID NO:55-108 are used to obtain 5* 
regulatory sequences using the procedure above, along with oligonucleotides designed for such 
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extension, and an appropriate genomic library. 

VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:55-108 are employed^to screen cDNAs, 
genomic DN As, or mRN As. Although the labeling of oligonucleotides, consisting of about 20 base : 
5 pairs, is specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 //Ci of [y- 32 P] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 

10 exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba f, or 
Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

15 membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

20 VIIL Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra) , mechanical 
microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . Suggested 

25 substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines well known to those of ordinary skill in the art and may 
contain any appropriate number of elements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; 

30 Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 
16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
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elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
5 desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 

complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described in 
detail below. : - - 

Tissue or Cell Sample Preparation 
10 Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A)* RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/pl oligo-(dT) primer (21mer) t IX first 
strand buffer, 0.03 units/id RNase inhibitor, 500 dATP, 500 jiM dGTP, 500 pM dTTP, 40 fiM 

{ i 

dCTP, 40 ixM dCTP-Cy3 (BDS) or dCTP-Cy5 {Amersham Pharmacia Biotech). The reverse 

15 transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 

GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 

20 using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 pi 5X SSC/0.2% SDS. 

25 Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 

30 ng. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
35 Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
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coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 110°C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 ^1 of the array element DNA, at an average 
5 concentration of 100 ng/pl, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crossl inked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays arc washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
10 buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 fjQ of sample mixture consisting of 0.2 ng each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

15 mixture is heated to 65 °C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1.8 cm 2 covcrslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 pi of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60 °C. The arrays are washed for 10 min at 45 °C in a first wash buffer (IX SSC, 

20 0.1 % SDS), three times for 10 minutes each at 45 °C in a second wash buffer (0. IX SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 

25 focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

30 Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 

35 although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 
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The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDN A control species added to the sample mixture at a known concentration. A specific location on 

the array contains a complementary DNA sequence, allowing the intensity of the signal at that ' 

i - f , 

location to be correlated with a weight ratio of hybridizing species of 1 : 1 00,000. When two samples 

" \ - ' -t * 

5 from different sources (e.g., representing test and control cells), each labeled with a different 

fluorophbre, are hybridized to a single array for the purpose of identifying genes that are differentially 

expressed, the calibrations done by labeling samples of the calibrating cDNA with the two 

fluorophores and adding identical amounts of each to the hybridization mixture. ' 

The output of the photomultiplier tube is digitized using a 1 2-bit RTI-835H analog-to-digital 

10, (A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from*blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 

15 emission spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

20 IX. Complementary Polynucleotides 

Sequences complementary to the CCYPR-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring CC YPR. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 

25 4.06 software (National Biosciences) and the coding sequence of CCYPR. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5 * sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the CCYPR-encoding transcript. 
X, Expression of CCYPR 

30 Expression and purification of CCYPR is achieved using bacterial or vims-based expression 

systems. For expression of CCYPR in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDN A 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tad) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 

35 element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21 (DE3). 
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Antibiotic resistant bacteria express CCYPR upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of CCYPR in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDN A encoding CCYPR by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral inf activity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Snodontera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et 
al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandigl V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

Inmost expression systems, CCYPR is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a pbptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicunu enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from CCYPR. at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra, 
ch. 10 and 1 6). Purified CCYPR obtained by these methods can be used directly in the assays shown in 
Examples XI and XV. 

XL Demonstration of CCYPR Activity 

An assay for CCYPR activity measures cell proliferation as the amount of newly initiated DNA 
synthesis in Swiss mouse 3T3 cells. A plasmid containing polynucleotides encoding CCYPR is 
transfected into quiescent 3T3 cultured cells using methods well known in the art. The transiently 
transfected cells are then incubated in the presence of [ 3 H]thymidine, a radioactive DNA precursor. 
Where applicable, varying amounts of CCYPR ligand are added to the transfected cells. Incorporation 
of [ 3 H]thymidine into acid-precipitable DNA is measured over an appropriate time interval, and the 
amount incorporated is directly proportional to the amount of newly synthesized DNA and CCYPR 
activity. 

XII. Functional Assays 

CCYPR function is assessed by expressing the sequences encoding CCYPR at physiologically 
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elevated levels in mammalian cell culture systems. cDNA is subclonedinto a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include pCMV SPORT plasmid (Life Technologies) and pCR3.1 plasmid (Invitrogen), both of which 
, contain the cytomegalovirus promoter. 5-10 ^g of recombinant vector are transiently transfected into a 
5 human cell line, for example, an endothelial or hematopoietic cell line, usang either liposome 

formulations ot electroporation. 1-2 of an additional plasmid containing sequences encoding a 
marker protein are cb-transfected. Expression of a marker protein provides a ; means to distinguish 
transfected cells from nontransfected cells and is a reliable predicts of cDNA expression from the " 
recombinant vector.^ Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 

10 Clontech), CD64, or a CD64 T GFP iusion protein. Flow, cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transacted cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DNA content as measured by staining of DNA with propidium iodide; changes in 

15 cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 
and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 

20 (1994) Flow Cytometry . Oxford, New York NY. 

The influence of CCYPR on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding CCYPR and either CD64 or CD64-GFP. CD64 and 
CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 

25 magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
mRNA can be purified from the cells using methods well known by those of skill in the art. Expression 
of mRNA encoding CCYPR and other genes of interest can be analyzed by northern analysis or 
microarray techniques. 

XIII. Production of CCYPR Specific Antibodies 

30 CCYPR substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

Hanington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the CCYPR amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
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synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g. , Ausubel, 1 995, supra, ch. 1 1 .) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
5 peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrich, St 
Louis MO) by reaction wilh N-nialeiinidobenzoyl-N-hydroxysuccinimjde ester (MBS) to increase 
immunogenic^. (See, e.g., Ausubel, 1995, suEra ) Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-CCYPR 
' activity by. for example, binding the peptide or CCYPR to a substrate, blocking with 1% BSA, reacting 
10 , with rabbit antiserav washing, and reacting with L radio-iodinated goat anti-rabbit IgG. 
XIV. Purification of Naturally Occurring CCYPR Using Specific Antibodies 

Naturally occurring or recombinant CCYPR is substantially purified by immunoaffinity 
chromatography using antibodies specific for CCYPR. An immunoaffinity column is constructed by 
covalently coupling anti-CCYPR antibody to an activated chromatographic resin, such as 
15 CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing CCYPR are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of CCYPR (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditioas that disrupt 
20 antibody/CCYPR binding (e.g.. a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and CCYPR is collected. 
XV. Identification of Molecules Which Interact with CCYPR 

CCYPR, or biologically active fragments thereof, are labeled with ,25 I Bolton-Hunter reagent. 
(See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
25 previously arrayed in the wells of a multi-well plate are incubated with the labeled CCYPR, washed, 
and any wells with labeled CCYPR complex are assayed. Data obtained using different concentrations 
of CCYPR arc used to calculate values for the number, affinity, and association of CCYPR with the 
candidate molecules. 

Alternatively, molecules interacting with CCYPR are analyzed using the yeast two-hybrid 
30 system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontcch). 

CCYPR may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
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No. 6,057,101). 



. -Various modifications and, variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without departing from the scope and spirit of the invention. . 
Although the invention has been described in.connection with certain embodiments, it should be ; 
understood that the invention as 'claimed should not be unduly limned to such specific embodiments 
indeed, various modifications of the described modes for carrying out the invention which areWious 
to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 

consisting of: i 

a) an amino acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID 
NO:2,.SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID Np:14. SEQ ID n6:15, SEQ ID NO:17, 
SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:2S, 
SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29; SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, 
SEQ ID NO:33, SEQ ID NQ:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, 
SEQ ID NO:39, SEQ ID NO:4l, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, 
SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, 
SEQ ID NO:53, and SEQ ID NO:54, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:10, SEQ ID NO:l 1, 
SEQ ID NO:12, SEQ ID NO:13. SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:18, 
SEQ ID NO:2G, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, 
SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, 
SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, 
SEQ ID NO:47, SEQ ID NO:48. SEQ ID NO:50, SEQ ID NO:51. SEQ ID NO:52, SEQ ID NO:53. 
and SEQ ID NO:54, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID 
NO:14, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
NO:23, SEQ ID NO:24, SEQ ID NO:25 T SEQ ID NO:26. SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:31 , SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:42, SEQ ID 
NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID 
NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, and SEQ ID NO:54, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO:l , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ 
ID NO:7, SEQ ID NO: 10, SEQ ID NO:ll, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
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NO:15, SEQ ID NO: 17. SEQ ID NO:18, SEQ ID NO:20. SEQ ID NO:22, SEQ ID NO:23. SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29. SEQ ID NO:30, SEQ ID, 
NO:31, SEQ ID'NO:32, SEQ ID NOf33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NQ:36, SEQ ID 
NO:37, SEQ ID NO:38, SEQ ID NCfc39, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID 
NO:44. SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:50, SEQ ID 
NO:51,SEQIDNO:52/SE.QIDNO:53,andSEQIDNd:54. ' ' 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l, 
SEQ ID NO:2, SEQ ID NO: 3, SEQ ID NO:4, SEQ IDNOS, SEQ ID Np:6, SEQ ID NO:7, SEQ ID 
NO:10, SEQ ID NO:l 1, SEQ ID NO:i2, SEQ ID NO: 13,- SEQ ID NO:R SEQ ID NO:15, SEQ.b 
NO:17, SEQIDNO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID 
NO:32. SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID 
NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48. SEQ ID NO:50, SEQ ID NO:5 1, SEQ ID 
NO:52, SEQ ID NO:53, and SEQ ID NO:54. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 
NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59. SEQ ID NO.60, SEQ ID 
NO:61, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID 
NO:69, SEQ ID NO:71. SEQ ID NO:72, SEQ ID NO:74, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID - 
NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID 
NO:91. SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:95, SEQ ID NO:96. SEQ ID NO:97, SEQ ID 
NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:104, SEQ 
ID NO:105, SEQ ID NO:106, SEQ ID NO:107, and SEQ ID NO:108. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 
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8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method for producing a polypeptide of claim 1 , the method comprising: 

a) cul turing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide, of claim 
l t and - x 

b) recovering the polypeptide so expressed. 

10: An isolated antibody which specifically binds to a polypeptide of claim 1. 

1 1. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ TD NO:55, SEQ ID 
NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID 
NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID 
NO:71, SEQ ID NO:72, SEQ ID NO:74, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID 
NO:79, SEQ ID NO:80, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID 
NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID 
NO:92, SEQ ID NO:93, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID 
NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102 t SEQ ID NO:104, SEQ ID NO:105, 
SEQ ID NO:106, SEQ ID NO:107, and SEQ ID NO: 108, 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:55, SEQ ID NO:56, SEQ 
ID NO:57, SEQ ID NO:58> SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:64, SEQ 
ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69 f SEQ ID NO:71, SEQ 
ID NO:72, SEQ ID NO:74, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ 
ID NO:80, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ 
ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ 
ID NO:93, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ 
ID NO.100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, 
SEQ ID NO:107, and SEQ ID NO:108, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 
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12. An isolated polynucleotide comprising at least 60 contiguous nucleotides 
polynucleotide of claim 11, , 

1 3. * A method for detecting a target polynucleotide in a sample, said target polynucleotide 
5 having a sequence pf a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
coinprising a sequence complementary to said target polynucleotide in the sample; and which probe 
specifically hybridizesto said tairget polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 
10 b > detecting the presence or absence of said hybridization complex, and, optionally, if 

present, the amount thereof. ; 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15 15 - A method for detecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
20 thereof, and, optionally, if present, the amount thereof. 

16. A composition comprising an effective amount of a polypeptide of claim 1 and a 
pharmaceutical^ acceptable excipient. 

25 17. A composition of claim 16, wherein the polypeptide comprises an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 
SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ 
ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ 
ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ 

30 ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ 
ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ 
ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ 
ID NO:48, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, and SEQ ID NO:54. 

35 1 8. A method for treating a disease or condition associated with decreased expression of 
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functional CCYPR, comprising administering to a patient in need of such treatment the composition 
of claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of . 
claim 1/ the method comprising: t 

a) exposing a sample comprising a polypeptide of. claim 1 to a compound, and 
. b) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutical^ acceptable excipient 

21 . A method for treating a disease or condition associated with decreased expression of 
functional CCYPR, comprising administering to a patient in need of such treatment a composition of 
claim 20. 

22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 
20 

23. A composition comprising an antagonist compound identified by a method of claim 22 
and a pharmaceutical^ acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
25 CCYPR, comprising administering to a patient in need of such treatment a composition of claim 23. 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1 , said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
30 conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
35 claim 1 , said method comprising: 
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a > combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and f, - ( -' - 4 r . 

c) comparing the activity of the polypeptide of claim 1 in die. presence of the test compound 
-with the activity of the polypeptide of claim 1 in the absence of the test. compound, wherein a change 
in the activity of the polypeptide of claim I in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

27. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 

28. A method for assessing toxicity of a test compound, said method comprising: 

a) treating a biological sample containing nucleic acids with the test compound; 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof; 

c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test, 
compound. 
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SEQUENCE LISTING 



110> INCYTE GENOMICS, INC. ■* ■ 

HILLMAN, Jennifer L. f 
LAL , Preeti * 

TANG, Y. Tom ' .- * 

YUE, .Henry 

AU- YOUNG , Janice 

BANDMAN , Olga 

AZIMZAI, Yalda r 

YANG , Junming s 

LU, Dyung Aina M. 

BAUGHN, Marian R. 

PATTERSON , Chandra , 

SHAH, £>urvi 

<120> CELL CYCLE AND PROLIFERATION PROTEINS 
<130> PF-0722 PCT ' > 

<140> To Be Assigned 
<141> Herewith 

<150> 60/145,075; 60/153,129; 60/164,647 

<151> 1999-07-21; 1999-09-08; 1999-11-10 

<160> 108 

<170> PERL Program 



<210> 1 
<211> 145 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 116462CD1 

<400> 1 . " 

Met Asn Gly Arg Val Asp Tyr Leu Val Thr Glu Glu Glu He Asn 
1 5 io ■ 15 

Leu Thr Arg Gly Pro Ser Gly Leu Gly Phe Asn He Val Gly Gly 
20 25 30 

Thr Asp Gin Gin Tyr Val Ser Asn Asp Ser Gly He Tyr Val Ser 
• 35 40 45 

Arg He Lys Glu Asn Gly Ala Ala Ala Leu Asp Gly Arg Leu Gin 
50 55 6° 

Glu Gly Asp Lys He Leu Ser Val Asn Gly Gin Asp Leu Lys Asn 
65 70 75 

Leu Leu His Gin Asp Ala Val Asp Leu Phe Arg Asn' Ala Gly Tyr 
80 85 90 

Ala Val Ser Leu Arg Val Gin His Arg Leu Gin Val Gin Asn Gly 
95 100 105 

Pro He Gly His Arg Gly Glu Gly Asp Pro Ser Gly He Pro He 
110 H5 120 

Phe Met Val Leu. Val Pro Val Phe Ala Leu Thr Met Val Ala Ala 
125 130 135 

Trp Ala Phe Met Arg Tyr Arg Gin Gin Leu 
140 145 

<210> 2 
<211> 340 
<212> PRT 
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<213> Homo sapiens 

<220> ■ f . • 1 

<221> miscjfeature ■ ■ t 

<223> Incyte ID. No: 1210462CD1 *• r . ' r 

<400> 2 ' " ' - ' . * f - ' ? 

Met Leu Thr Gin Leu Lys Ala Lys Ser Glu Gly Lys Leu Ala L^s 
? 5 v 10 15 

Gin He Cys Lys Val val. Leu Asp His Phe Glui Lys Gin Tyr Ser 

'20 f -' 25 3q - 

Lys Glu Leu Gly Asp Ala Trp Asn Thr Val Argtalu He Leu Thr ' ' 

4 35 ' 4 ° 45 

Ser Pro Ser. Cys' Trp Gin Tyr Ala Val, Leu Leu Asn Arg Prie Asn. 

Tyr Pro Phe Glu Leu Glu Lys Asp Leu His Leu Lys Gly Tyr His 

65 '70 . 75 

Thr Leu Ser Gin Gly Ser Leu Pro Asn Tyr ' Pro Lys Ser Val Lys 

„ " 80 V 85 v 90 

Cys Tyr Leu Ser Arg- Thr Pro Gly Arg He Pro ker Glu Arg His > 

_ , 95 100 10 5 \ 

Gin He Gly Asn Leu Lys Lys Tyr Tyr Leu Leu Asn Ala Ala Ser * 

110 115 120 

Leu Leu Pro Val Leu Ala Leu Glu Leu Arg Asp Gly Glu Lys Val 

Leu Asp Leu Cys Ala Ala Pro Gly Gly Lys Ser He Ala Leu HI 
140 145 i5o 

Gin Cys Ala Cys Pro Gly Tyr Leu His Cys Asn Glu Tyr Asp Ser 
155 160 16 5 

Leu Arg Leu Arg Trp Leu Arg Gin Thr Leu Glu Ser Phe He Pro 

170 175 180 

Gin Pro Leu He Asn Val He Lys Val Ser Glu Leu Asp Gly Arg 

18.5 190 195 

Lys Met Gly Asp Ala Gin Pro Glu Met Phe Asp Lys Val Leu Val 

200 205 210 

Asp Ala Pro Cys Ser Asn Asp Arg Ser Trp Leu Phe Ser Ser Asp 

215 220 225 

Ser Gin Lys Ala Ser Cys Arg He Ser Gin Arg Arg Asn Leu Pro 

230 235 240 

Leu Leu Gin He Glu Leu Leu Arg Ser Ala He Lys Ala Leu Arg 

245 250 255 

Pro Gly Gly lie Leu Val Tyr Ser Thr Cys Thr Leu Ser Lys Ala 

260 265 270 

Glu Asn Gin Asp Val He Ser Glu He Leu Asn Ser His Gly Asn 

275 280 285 

He Met Pro Met Asp He Lys Gly He Ala Arg Thr Cys Ser His 

290 295 300 

Asp Phe Thr Phe Ala Pro Thr Gly Gin Glu Cys Gly Leu Leu Val 

305 310 315 

He Pro Asp Lys Gly Lys Ala Trp Gly Pro Met Tyr Val Ala Lys 

320 3 25 33 0 
Leu Lys Lys Ser Trp Ser Thr Gly Lys Trp 

335 3 4o 

<210> 3 
<211> 418 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1305252CD1 

<400> 3 
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Met 


Leu 


Tvr 


Leu 


Glu 


Asp. 


Tyr 


Leu 


Glu 


Met 


He 


Glu 


Gin 


Leu 


Pro 


1 






5 










10 










15 


Met 


Asp 


Leu 




Asp 


Arcr 


Phe 


Thr 


Glu 


Met 


Arg 


Glu 


Met 


Asp 


Leu 








20 










25 










30 


Gin *Val 


Gin 


Asn 


Ala 


Met 


ASD 


Gin 


Leu 


Glu 


Gin 


Arg 


Val 


Ser 


Glu 










35 










40 










45 


Phe- 


Phe 




Asn 


Ala 
50 


Lys 


Lys 


Asn 


Lvs 


Pro 

55- 


Glu 


Trp 


Arg 


Glu 


Glu 
60 


Gln ■ 


Met. 


t\±. a 


OCI 


lie 
65 


Lys 


Lys 


Asp 


Tvr 


Tyr 
70 


Lys 


Ala 


Leu 


Glu 


Asp 
75 


Ala 


Asp 


a*] 1 1 




Val 


Gin 


Leu 


Ala 


Asn 


Gin 


lie 


Tyr 


Asp 


Leu 


Val 






80 










85 










90 


Asp 


Arg 


mo 


LlCU 


Arg Lys ; 


Leu 


Asp 


Gin 


Glu 


Leu 


Ala 


Lys . 


Phe 


Lys 






95 










100 










105 


Met 


Glu 


Leu 


pin 


Ala 
110 


Asp 


Asn 


Ala 


Glv 


He 
115 


Thr 


Glu 


lie 


Leu 


Glu 
120 


Arg 


Arg 


oci 


Leu 


Glu 


Leu. 


Asp 


Thr 


Pro 


Ser 


Gin- 


Pro 


Val 


Asn 


Asn 






12 5.- 










130 










135 


His 


His 


r\ _L ci 


His 


Set 


His 


Thr 


Pro 


Val 


Glu 


Lys 


Arg. 


Lys 


Tyr 


Asn 








140 










145 










150 


Pro 


Thr 


Ser 


nib 


His 
155 


Thr 


Thr 


Thr 


Asp 


His 
160 


lie 


Pro 


Glu 


Lys 


Lys 
165 


Phe 


Lys 


Ser 


nil ii 


Ala 


Leu 


Leu 


Ser 


Thr 


Leu 


Thr 


Ser 


Asp 


Ala 


Ser 








170 










175 










180 


Lys 


Glu 


Asn 


TVit* 

1 Hi 


Leu 


Gly 






Asn 


Asn 


Asn 


Ser 


Thr 


Ala 


Ser 








185 










190 










195 


Ser 


Asn 


Asn 


Ala 


Tyr 
200 


Asn 


Val 


Asn 


Ser 


Ser 
205 


Gin 


Pro 


Leu 


Gly 


Ser 
210 


Tyr 


Asn 


He 


uiy 


Ser 


Leu 


Ser 


Ser 


Glv 


Thr 


Gly Ala 


Gly Ala 


lie 








215 










220 










225 


Thr 


Met 


Ala 


Ala 


Ala 
230 


Gin 


Ala 


Val 


Gin 


Ala 
235 


Thr 


Ala 


Gin 


Met 


Lys 
240 


Glu 


Gly Arg 


Arg 


Thr 


Ser 


Ser 


Leu 


Lys 


Ala 


Ser 


Tyr 


Glu 


Ala 


Phe 










245 










250 










255 


Lys 


Asn 


Asn 


Asp 


Phe 


Gin 


Leu 


Glv 


Lys 


Glu 


Phe 


Ser 


Met 


Ala 


Arg 






260 










265 










270 


Glu 


Thr 


Val 


Gly Tyr 


Ser 


Ser 


Ser 


Ser 


Ala 


Leu 


Met 


Thr 


Thr 


Leu 










275 










280 










285 


Thr 


Gin 


Asn 


Ala 


Ser 
290 


Ser 


Ser 


Ala 


Ala 


Asp 
295 


Ser 


Arg 


Ser 


Gly 


Arg 
300 


Lys 


Ser 


Lys 


Asn 


Asn 


Asn 


Lys 


Ser 


Ser 


Ser 


Gin 


Gin 


Ser 


Ser 


Ser 






305 










310 










315 


Ser 


Ser 


Ser 


Ser 


Ser 
320 


Ser 


Leu 


Ser 


Ser 


Cys 
325 


Ser 


Ser 


Ser 


Ser 


Thr 
330 


Val 


Val 


Gin 


Glu 


He 
335 


Ser 


Gin 


Gin 


Thr 


Thr 
340 


Val 


Val 


Pro 


Glu 


Ser 
345 


Asp 


Ser 


Asn 


Ser 


Gin 


Val 


Asp 


Trp 


Thr 


Tyr 


Asp 


Pro 


Asn 


Glu 


Pro 








350 










355 










360 


Arg 


Tyr 


Cys 


He 


Cys 


Asn 


Gin 


Val 


Ser 


Tyr 


Gly 


Glu 


Met 


Val 


Gly 






365 










370 










375 


Cys 


Asp 


Asn 


Gin 


Asp 


Cys 


Pro 


He 


Glu 


Trp 


Phe 


His 


Tyr 


Gly 


Cys 






380 










385 










390 


Val 


Gly 


Leu 


Thr 


Glu 


Ala 


Pro 


Lys 


Gly 


Lys 


Trp 


Tyr 


Cys 


Pro 


Gin 








395 










400 










405 


Cys 


Thr 


Ala 


Ala 


Met 


Lys 


Arg 


Arg 


Gly 


Ser 


Arg 


His 


Lys 












410 










415 













<210> 4 

<211> 297 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
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<223> Incyte ID No: 1416289CD1 
<400> 4 

Met Ala Tyr Asn Val lie lie He Tyr Phe Asn Phe Arg Cys Leu 

5: 10 • 15 

Glu Trp Leu Leu Asn Asn Leu Met >hr His Gin Asn Val Glu Leu 
20 25 . . ' 30 

Phe Lys Glu Leu Ser He Asn Val Met Lys Gin Leu lie Gly Ser 
; 35 " 40 t . - * 45 

s Ser Asn Leu Phe Val Met Gin Val Glu Met Asp He Tyr Thr Ala 
50 55 i ' 60 

; Leu Lys Lys Trp Met Phe Leu Gin Leu Val Pro Ser Trp^Asn Gly 
.65 , 70 * ' " >• 75 

Ser Leu Lys Gin Leu Leu Thr Glu Thr Asp Val Trp Phe Ser '-Lys 
80 ■ ' 85 90 

Gin Arg Lys Asp Phe Glu rGly Jtfet Ala Phe Leu Glu Thr Glu Gin 
95 f * ' 100 x ~ 105 

Gly Lys Pro Phe Val Ser / Val , Phe Arg His Leu Arg- Leu" Gin Tyr 
v HO H5 !20 

He He Ser Asp Leu Ala >Ser Ala Arg He He Glu Gin Asp Ala 
125 130 135 

Val Val Pro Ser Glu Trp Leu Ser Ser Val Tyr Lys Gin Gin Trp 
140 145 150 

Phe Ala Met Leu Arg Ala Glu Gin Asp Ser Glu Val Gly Pro Gin 
155 160 165 

Glu He Asn Lys Glu Glu Leu Glu Gly Asn Ser Met Arg Cys Glv 
170 175 18 Q 

Arg Lys Leu Ala Lys Asp Gly Glu Tyr Cys Trp Arg Trp Thr Gly 
185 190 195 

Phe Asn Phe Gly Phe Asp Leu Leu Val Thr Tyr Thr Asn Ara Tyr 
200 205 210 

He He Phe Lys Arg Asn Thr Leu Asn Gin Pro Cys Ser Gly Ser 

215 220 
Val Ser Leu Gin Pro Arg Arg Ser He Ala Phe Arg Leu Arg Leu 

230 235 240 

Ala Ser Phe Asp Ser Ser Gly Lys Leu He Cys Ser Arg Thr Thr 

245 250 255 

Gly Tyr Gin He Leu Thr Leu Glu Lys Asp Gin Glu Gin Va^ Val 

260 265 270 

Met Asn Leu Asp Ser Arg Leu Leu He Phe Pro Leu Tyr He Cys 

275 280 285 

Cys Asn Phe Leu Tyr He Ser Pro Glu Lys Lys Asn 

290 295 

<210> 5 
<211> 184 
<212> PRT 

<2 1 3 > Homo sapi ens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1558289CD1 

<400> 5 

Met Glu Ser Phe Ser Ser Lys Ser I 

1 5 
Lys Leu Leu Ser Lys Met Ala Gly a 
20 

He Asp Glu Thr Ser Ser Glu Val L 
35 

Ser Lys Glu Tyr Thr His Ser Arg P 
50 

Lys Asp Leu He Lys Val Ala He L 
65 
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Ala 


Leu 


Gin 


Ala 


Glu 


Lys 


10 










15 


Ser 


Val 


Ala 


His 


Leu 


Phe 


25 










30 


Asp 


Glu 


Leu 


Tyr 


Arg 


Val 


. 40 








45 


Gin 


Ala 


Gin 


Arg 


Val 


He 


55 










60 


Val 


Ala 


Val 


Leu 


His 


Arg 


70 










75 
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Asn 


Gly 


Ser 


Phe 


Gly 


Pro 


Ser 


Glu 


Leu 


Ala 


Leu 


Ala 


Thr 


Arg 


Phe 










80 










85 










90 


Arg 


Gin 


Lys 


Leu 


Arg 


Gin 


Gly 


Ala 


Met 


Thr 


Ala 


Leu 


Ser 


Phe 


Gly 










95 










100 










105 


Glu 


Val 


Asp 


Phe. 


Thr 


Phe 


Glu 


Ala 


Ala 


Val 


Leu 


Ala 


Gly 


Leu 


Leu 










110 










115 










120 


Thr 


Glu 


Cys 


Arg 


Asp 


Val 


Leu 


Leu 


Glu 


Leu 


Val 


Glu 


His 


His 


Leu 










125 










130 










135 


Thr 


Pro 


Lys 


Ser 


His 


Gly 


Arg 


He 


Arg 


His 


Val 


Phe 


Asp 


His 


Phe 










14 0 










145 










150 


•Ser 


Asp 


Pro 


Gly 


Leu 


Leu 


Thr 


Ala 


Leu 


Tyr 


Gly 


Pro 


Asp 


Phe 


Thr 










155. 










160 










165 


Gin 


His 


Leu 


Gly 


Lys 


lie 


Cys 


Asp 


Gly 


Leu 


Arg 


Lys^ 


Leu 


Leu 


Asp 










170 










175 










180 


Glu 


Gly 


Lys 


Leu 
























'*210> .6 




























<2'11> 173 


























<212> PRT 


























<213> Homo 


sapiens 






















<220> 




























<221> misc_ 


feature 






















<223> Incyte ID 


No: 


1577739CD1 
















<400> 6 




























Met 


Asp 


Val 


Arg 


Arg 


Val 


Leu 


Val 


Lys 


Ala 


Glu 


Met 


Glu 


Lys 


Phe 


1 








5 










10 










15 


Leu 


Gin 


Asn 


Lys 


Glu 


Leu 


Phe 


Ser 


Ser 


Leu 


Lys 


Lys 


Glv 


Lys 


He 










20 










25 










30 


Cys 


Cys 


Cys 


Cys 


Arg 


Ala 


Lys 


Phe 


Pro 


Leu 


Phe 


Ser 




Pro 


Pro 










35 










40 










45 


Ser 


Cys 


Leu 


Phe 


Cys 


Lys 


Arg 


Ala 


Val 


Cys 


Thr 


Ser 


Cys 


Ser 


He 










50 










55 










60 


Lys 


Met 


Lys 


Met 


Pro 


Ser 


Lys 


Lys 


Phe 


Glv 


His 


He 


Pro 


Val 


Tvr 










65 










70 










75 


Thr 


Leu 


Gly 


Phe 


Glu 


Ser 


Pro 


Gin 


Aro 


Val 


Ser 


Ala 


Ala 


Lys 


Thr 










80 










85 










90 


Ala 


Pro 


He 


Gin 


Arg 


Arg 


Asp 


He 


Phe 


Gin 


Ser 


Leu 


Gin 


Glv 


Pro 










95 










100 










105 


Gin 


Trp 


Gin 


Ser 


Val 


Glu 


Glu 


Ala 


Phe 


Pro 


His 


He 


Tvr 


Ser 


His 










110 










115 










120 


Gly 


Cys 


Val 


Leu 


Lys 


Asp 


Val 


Cys 


Ser 


Glu 


Cvs 


Thr 


Ser 


Phe 


Val 










125 










130 










13 5 


Ala 


Asp 


Val 


Val 


Arg 


Ser 


Ser 


Arg 


Lys 


Ser 


Val 


Asp 


Val 


Leu 


Asn 










140 










145 










150 


Thr 


Thr 


Pro 


Arg 


Arg 


Ser 


Arg 


Gin 


Thr 


Gin 


Ser 


Leu 


Tyr 


He 


Pro 
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feature 






















<223> Incyte ID 
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Val 
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Val 


Ala 


Val 
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Ala 


Ala 


Val 
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Pro 
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1 
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10 
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He 


Asn 


Ser 
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Phe 


Ser 
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Leu 


Arg 


Glu 


He 


Lys 


Lys 


Gin 


Leu 
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20 










Leu 


Leu 
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Ala 


Gly 


Leu 


Thr 


Arg 


Glu 










35 
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Lys 


m 

Trp 
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Ala 


Glu 


Leu 


Ala 


Phe 










- 50 










v Leu 


Ala 


Glu 


Leu 


Gin 


Pro 




Pro 


Pro 
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Gin 


Asp 


Met 


Asp Ala Tyr Thr Leu # Ala 










~ 80 










Lys 


Glu 


Tyr 
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Arg 


Ala 


Ala 


His 


Phe 
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Lys 


Lys 


Ala 


Tyr 


(Phe 


Leu 


Tyr 


Met 


Tyr 










110 


• 








Glu 


Lys 


Lys 


Lys 


Asp Asp 


"Glu 


Thr 


Val 
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Glu 


Lys 


Gly 


Gin 


Val 


Lys 


Asn 


Glu 


Ala 




















Glu 


Leu 


Ser 


Lys 


Lys 


His 


Gin 


Ala 


Arg 










155 




i 






Leu 


Tyr 


Leu 


Tyr Gly Val 


Val 


Leu 


Arg 










170 










Glu 


Ala 


He 


Asp 


Val 


Phe 


Val 


Glu 


Ala 










185 










His 


Trp 


Gly Ala 


Trp 


Leu 


Glu 


Leu 


Cys 










200 










Glu 


Met 


Leu 


Lys 


Phe 


Leu 


Ser 


Leu 


Pro 










215 










Phe 


Phe 


Leu 


Ala 


His 


He 


Tyr 


Thr 


Glu 










230 










Ala 


Leu 


Gin 


Lys 


Tyr 


Gin 


Asn 


Leu 


He 










245 










Ser 


Ser 


Tyr 


He 


Val 


Ser 


Gin 


He 


Ala 










260 










Arg 


Asp 


He 


Asp 


Lys 


Ala 


Leu 


Ser 


He 










275 










Gin 


Asp 


Pro 


Tyr 


Arg 


He 


Glu 


Asn 


Met 










290 










Leu 


Tyr 


Val 


Arg 


Ser 


Met 


Lys 


Ser 


Glu 










305 
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Leu 


Cys 


Glu 


He 
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Arg 
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lie 
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Leu 
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Ser 










335 
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Tyr 
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Arg 


Ala 
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Leu 










350 










Ala 


Trp 


Thr 


Leu 


Met 


Gly His 


Glu 


Tyr 










365 








Ser 


Ala 


Ala 


He 


Gin 


Ala 


Tyr 


Arg 


His 










380 










Arg 


Asp 


Tyr 


Arg 


Ala 


Trp 


Tyr 


Gly 


Leu 










395 










Leu 


Lys 


Met 


Pro 


Phe 


Tyr 


Cys 


Leu 


Tyr 










410 












Leu 


Arg 


Pro 


Asn 


Asp 


Ser 


Arg 


Met 










425 










Cys 


Tyr 


Glu 


Lys 


Leu 


Asn 


Gin 


Leu 


Val 










440 










Trp 


Arg 


Ala 


Tyr 


Ala 


Val 


Gly 


Asp 


Val 










455 










Lys 


Leu 


Ala 


Lys 


Leu 


His 


Glu 


Gin 


Leu 










470 










Ala 


Gin 


Cys 


Tyr 


He 


Lys 


Tyr 


He 


Gin 










485 
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25 










30 


Arg Gly 


Leu 


Leu 


His 


Ser 


40 










45 5 


Ser 


Leu 


Pro 


Ala 


Leu 


Pro 


55 










60 


He 


Thr 


Giu 


Glu 


Asp 


Ala 


70 










75 


Lys t Ma Tyr Phe Asp Val 


85 










90 


Leu 


His 


Gly 


Cys 


Asn 


Ser 


100 










105 


Ser 


Arg 


Tyr 


Leu 


Ser 


Gly - 


iis 










120 


Asp' Ser I^eu Gly 


Pro 


Leu 


130 










.135 


Leu 


Arg 


Glu 


Leu 


Arg 


Val 


ii& 










150 


Glu 


Leu 


Asp Gly 


Phe 


Gly 


160 










165 


Lys 


Leu 


Asp 


Leu 


Val 


Lys 


175 










180 


Thr 


His 


Val 


Leu 


Pro 


Leu 


190 










195 


Asn 


Leu 


He 


Thr 


Asp 


Lys 


205 










210 


Asp 


Thr 


Trp 


Met 


Lys 


Glu 


220 










225 


Leu 


Gin 


Leu 


He 


Glu 


Glu 


235 










240 


Asp 


Val 


Gly 


Phe 


Ser 


Lys 


250 










255 


Val 


Ala 


Tyr 


His 


Asn 


He 


265 










270 


Phe 


Asn 


Glu 


Leu 


Arg 


Lys 


280 










285 


Asp 


Thr 


Phe 


Ser 


Asn 


Leu 


295 










300 


Leu 


Ser 


Tyr 


Leu 


Ala 


His 


310 










315 


val 


Glu 


Thr 


Cys 


Cys 


Val 


325 










330 


Gin 


His 


Glu 


Lys 


Ala 


Ala 


340 










345 


Asn 


Pro 


Arg 


Tyr 


Leu 


Gly 


355 










360 


Met 


Glu 


Met 


Lys 


Asn 


Thr 


370 










375 


Ala 


He 


Glu 


Val 


Asn 


Lys 


385 










390 


Gly 


Gin 


Thr 


Tyr 


Glu 


He 


400 










405 


Tyr 


Cys 


Arg 


Arg 


Ala 


His 


415 










420 


Leu 


Val 


Ala 


Leu 


Gly 


Glu 


430 










435 


Glu 


Ala 


Lys 


Lys 


Cys 


Tyr 


445 










450 


Glu 


Lys 


Met 


Ala 


Leu 


Val 


460 










465 


Thr 


Glu 


Ser 


Glu 


Gin 


Ala 


475 










480 


Asp 


He 


Tyr 


Ser 


Cys 


Gly 


490 










495 
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Glu 


lie 


Val 


Glu 


His 
500 


Leu 


Glu 


Glu 


Ala 


Gin 


Tyr 


Tyr 


Phe 
515 


Lys 


Cys 


Lys 


Cys 


Ala 


Gin 


Lys 


Cys 
530 


Cys 


Ala 


Phe 


Lvs. 


Ala 


Leu 


Leu 


Arg 


Gin 


He 


Leu 








545 








Thr 


Pro 


Thr 


Thr 


Glu 
560 


Val 


Pro 


Ala 


Leu 


Ser 


Ala 


Asn 


Asn 

575 


Thr 


Pro 


Thr 


Leu 


Ser 


Ser 


val 


Thr 
590 


Pro 







<210> 8 , 
<211> 463 
<212> PRT 

*<213> Homo sapiens 



<220> - 

<221> raisc_feature 

<223> Incyte ID No: 1887228CD1 



Ser 


Thr 
505 


Ala 


Phe 


Arg 


Tyr 


Leu 
510 


Leu 


Trp 
520 


Asp 


Glu 


Ala 


Ser 


Thr 
525 


Asn 


Asp 


Thr 


Arg, 


Glu .Glu 


Gly 




535 










540 


Gin 


Leu 


Arg 


Asn 


Gin 


Gly Glu 




550. 










555 


Pro 


Phe 


Phe 


Leu 


Pro 


Ala 


Ser 




565 










570 


Arg 


Arg 
580 


Val 


Ser 


Pro 


Leu 


Asn 
585 



<400> 8 



Met 


Pro 


Leu 


Leu 


Asn 


Trp 


V Cli. 


Ala 


Leu 


Lys 


Pro 


Ser 


Gin 


He 


Thr 


1 








J 










10 










15 


Glv Thr 


Val 




x Il-L 


Glu 


Leu 


Asn 


Asp 


Glu 


Lys 


Val 


Leu 


Gin 


Glu 










2 0 










25 










30 


Leu 




1 itr L. 




Asp 


Phe 


Glu 


Glu 


Gin 


Phe 


Lvs 


Thr 


Lys 


Ser 


Gin 








3 5 










40 
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Gly 
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Ser 


Ala 
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Lvs 


Ser 


Lys 


Ala 


Ala 


Gin 








50 










55 










60 


Lys 


Ala 
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Lys 


Ala 


Thr 


Leu 


He 


Glu 


Ala 


Asn 


Arg 


Ala 


Lys 








65 










70 
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Leu 
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He 


Thr 


Leu 


Arg 


Lys 


Gly 


Asn 


Leu 
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Ala 


Glu 
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85 
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He 
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Gin 


Ala 


He 


Glu 


Ala 
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Asp 


Leu 


Gin 


Ala 


Leu 
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100 










105 


Asp 
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Leu 


Glu 


Leu 


Leu 


Met 


Arg 


Phe 


Leu 
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Thr 


Glu 


Tyr 


Glu 








110 










115 










120 


Arg 
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Leu 


He 


Thr 


Arg 


Phe 


Glu 


Arg 


Glu 


Gin 


Arg 


Pro 


Met 


Glu 
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Glu 
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Ser 
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Arg 


Phe 


Met 


Leu 
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Phe 


Ser 


Arg 


He 










140 










145 










150 


Pro 


Arg 


Leu 


Pro 


Glu 


Arg 


Met 


Thr 


Thr 


Leu 


Thr 


Phe 


Leu 


Gly 


Asn 








155 










160 










165 


Phe 


Pro 


Asp 


Thr 


Ala 


Gin 


Leu 


Leu 


Met 


Pro 


Gin 


Leu 


Asn 


Ala 


He 








170 










175 










180 


He 


Ala 


Ala 


Ser 


Met 


Ser 


He 


Lys 


Ser 


Ser 


Asp 


Lys 


Leu 


Arg 


Gin 










185 










190 










195 


He 


Leu 


Glu 


He 


Val 


Leu 


Ala 


Phe 


Gly 


Asn 


Tyr 


Met 


Asn 


Ser 


Ser 










200 










205 










210 


Lys 


Arg 


Gly 


Ala 


Ala 


Tyr 


Gly 


Phe 


Arg 


Leu 


Gin 


Ser 


Leu 


Asp 


Ala 






215 










220 










225 


Leu 


Leu 


Glu 


Met 


Lys 


Ser 


Thr 


Asp 


Arg 


Lys 


Gin 


Thr 


Leu 


Leu 


His 










230 










235 










240 


Tyr 
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Val 


Lys 


Val 


He 
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Glu 


Lys 


Tyr 


Pro 


Gin 


Leu 


Thr 


Gly 






245 
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Ala 
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Ser 


Val 


Ser 
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Val 
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Val 


Arg 
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Leu 
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Met 


Val 
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290 295 300 

Leu Lys Glu Phe Leu Arg Ala Asn Ser Pro Thr Met Asp Lys Leu 

305 310 315 

Leu Ala Asp Ser Lys Thr Ala Gin Glu Ala Phe Glu Ser Val Val 1 

320 ' 325 , 330 . z 

Glu Tyr Phe Gly Glu Asn t>ro,-Lys Thr Thr Ser Pro Gly Leu Phe 

335 340 ' 345 . 

Phe Ser Leu Phe Ser Arg Phe fie Lys Ala Tyr Lys -Lys A*ia Glu 

3 ?0 ..355 % 3 } Q{ : 

Gin Glu Val,. Glu- Gin Trp Lys Lys Glu Ala -Ala Ala Gin Glu Ala V N 

3 65 v , 370 - 375 

Gly Ala Asp Thr Pro Gly . Lys Gly Glu Pro Pro Ala vPro Lys Ser ' v 

380 ' ' . - 385 ' ' . ' 390 , r 

Pro Pro Lys,. Ala Arg Arg Pro .Gin Met, Asp; Leu He Ser Glu Leu \ 

395 t 400 . - 405 

Lys Arg Arg, Gin ' Gin Lys Glu Pro Leu He Tyr Glu Ser Asp Arg 

41 P . J 415 t 420 - 

Asp Gly Ala He. Glu Asp He He Thr Asp , Leu Arg Asn Gin Pro t 

425 * 430 * - 435 , * 

Tyr He Arg. Ala Asp Thr Gly Arg Arg >Ser Ala Arg Arg Arg Pro ' 

440 445 450 

Pro Gly Pro Pro Leu Gin Val Thr Ser Asp Leu Ser Leu 

455 460 

<210> 9 
<211> 270 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1988468CD1 
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Val 
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Pro 
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Arg 


Phe 


Asn 
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Ser 
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Gin 
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Ser 


Leu 
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Asn 


Gin 
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Phe 


Asn 
125 


His 


His 


Pro 


Pro 


Asp 


Leu 


His 


Pro 
140 


Ala 
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Gly 


Gin 


His 


Phe 


Arg 
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155 
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Asn 


Pro 


Thr 


Pro 


Gly Gly 


Ser 
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Ser 
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Ser 


Ser 


Ser 
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Gly 
185 
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Ala 


Gly 


Ser 
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Asn 
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Met 
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Met 
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Pro 


Pro 
215 


Asn 


Val 


He 



Asn 


His 
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Arg 
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Pro 


Asp 
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Pro 


Ala 


His 


Arg 


Met 


Gly 


Met 




25 
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His 


Gin 


Gin 


Gin 


Gin 


Pro 


Gin 




40 










45 
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His 


He 


His 
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Gly 
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He 
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Met 
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Pro 
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Ala 
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Met 
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Met 
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120 
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Met 
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Asn 
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Ser 
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Glu Val Leu Met Ser Leu Val lie 
230 

Lys Glu Leu Pro Glu Leu Trp Leu 
245 

Met Thr Asp Phe Val Cys Lys Gin 

. 260 . 

<210> 10 < 

<211> 255 

<212> PRT 

<2x3> Homo sapiens 

<220> 

<221> misc_f eature 
<223> ; Incyte ID No: 2049176CD1 

<400> 10 

Met Val Ser Trp Met lie Ser Arg 

Met Leu Tyr Pro Ala Tyr Tyr Ser 
20 

Asn Val Lys Glu Tyr Val Arg Trp 
35 

Ala Leu Tyr Thr Val lie Glu Thr 
50 

Trp Phe Pro Leu Tyr Tyr Glu Leu 
65 

Leu Leu Ser Pro Tyr Thr Lys Gly 
80 

Phe Leu His Pro Leu Leu Ser Ser 
95 

Tyr lie Val Gin Ala Lys Glu Arg 
110 

Phe Gly Arg Gin Gly Leu Asn Leu 
125 

Ala Ala Val Lys Ser Gin Gly Ala 
140 

Phe Ser Met His Asp Leu Thr Thr 
155 

Gly Gin Arg Pro Tyr Gin Pro Leu 
170 

Lys Pro Ala Pro Ser Glu Ser Ala 
185 

Asp Gly Asp Glu Lys Thr Asp Glu 

200 

Asp Asn Glu Met Leu Thr His Lys 
215 

Met Lys Ser Val Lys Thr Thr Lys 
230 

Gly Ser Leu Lys Tyr Lys Val Lys 
245 



Glu Met Gly Leu Asp Arg lie 
235 240 

Gly Gin Asn Glu Phe Asp Phe 
250 255 

Gin Pro Ser Arg Val Ser Cys 
265 270 



A 1 A 
AX C* 


Val 


Val 


Leu 


Val 


Phe 


Glv 




X VJ , 










15 


Tyr 


T a/c 
i_jy o 


Ala 


val 


xjy o 


Thr 


Lys 




25 










30 


Met 




Tyr 


Trp 


He 


Val 


Phe 




40 










45 


V CI X 


Ala 


Asp 


Gin 


Thr 


Val 


Ala 














60 


Lys 


He 


Ala 


Phe 


Val 


He 


Trp 


70 










75 


Ala 


Ser 


Leu 


He 


Tyr 


Arg 


Lys 




85 










90 


Lys 


Glu 


Arg 


Glu 


He 


Asp 


Asp 


100 










105 


Gly 


Tyr 


Glu 


Thr 


Met 


Val 


Asn 




115 










120 


Ala 


Ala 


Thr 


Ala 


Ala 


Val 


Thr 




130 










135 


He 


Thr 


Glu 


Arg 


Leu 


Arg 


Ser 




145 










150 


He 


Gin 


Gly Asp 


Glu 


Pro 


Val 




160 










165 


Pro 


Glu 


Ala 


Lys 


Lys 


Lys 


Ser 




175 










180 


Gly 


Tyr 


Gly 


.He 


Pro 


Leu 


Lys 




190 










195 


Glu 


Ala 


Glu 


Gly 


Pro 


Tyr 


Ser 




205 










210 


Gly 


Leu 


Arg 


Arg 


Ser 


Gin 


Ser 


220 










225 


Gly 


Arg 


Lys 


Glu 


Val 


Arg 


Tyr 




235 










240 


Lys 


Arg 


Pro 


Gin 


Val 


Tyr 


Phe 


250 










255 



<210> 11 

<211> 533 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2686765CD1 

<400> 11 

Met Ser Gly Thr Leu Glu Ser Leu Ala Asp Asp Val Ser Ser Met 
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1 5 io i 5 

Gly Ser Asp Ser Glu He Asn Gly Leu Ala Leu Arg Lys Thr Asp 
20 ; 25 30 

Lys Tyr Gly - Phe Leu Gly Gly Ser Gin Tyr Ser Gly Ser Leu Glu 
' 35 - , « 40. 1 . ■ *. 45 

Ser Ser lie Pro Val Asp Val Ala Arg Gin Arg Glu Leu Lys Trp 

60 ■ • 55 60 ^ v 

^Leu' Asp Met Phe Ser Asn vTrp.Asp Lys Trp Leu Ser Arg Arg Phe ' 

65 / ; 70 i '75 

Gin Lys Val Lys Leu Arg Cys Arg Lys Gly He Pro Ser iSer Leu 
* , > 80 * f. 85 f ' 1 90 

Arg Ala Lys Ala Trp Gin Tyr. Leu Ser Asn Ser Lys Glu: Leu Leu 
95 ' 100 r' 1 105 

Glu Gin Asn Pro Gly Lys Phe Glu £31u Leu Glu Arg Ala Pro ■ Gly 
- 110 ' ' H5, 120 

Asp Pro Lys Trp Leu. .Asp Val He Glu Lys Asp Leu His Arg Gin . ' 

12 5 ' . 131Q . - 125 X 

t Phe Pro Phe His Glu Met Phe Ala Ala Arg Gly Gly His Gly Gin : 

" 140 * 145 t * , : 150 . * 

Gin Asp ;,Leu Tyr Arg He Leu Lys Ala Tyr Thr He Tyr Arg Pro 

* 155 160 i 6 5 \ 

Asp Glu Gly Tyr Cys Gin Ala Gin Ala Pro Val Ala Ala -Val Leu 
170 175 iso 

Leu Met His Met Pro Ala Glu Lys Pro Phe Gly Ala Trp Val Gin 
185 190 195 

He Cys Asp Lys Tyr Leu Pro Gly Tyr Tyr Ser Ala Gly Leu Glu 
200 205 210 

Ala He Gin Leu Asp Gly Glu He Phe Phe Ala Leu Leu Arg Arg 
215 220 225 

Ala Ser Pro Leu Ala His Arg His Leu Gin Arg Gin Arg He Asp 
230 235 240 

Pro Val Leu Tyr Met Thr Glu Trp Phe Met Cys He Phe Ala Arq 
245 250 255 

Thr Leu Pro Trp Ala Ser Val Leu Arg Val Trp Asp Met Phe Phe 
260 265 270 

Cys Glu Gly Val Lys He He Phe Arg Val Ala Leu Val Leu Leu 
275 280 285 

Arg His Thr Leu Gly Ser Val Glu Lys Leu Arg Ser Cys Gin Gly 
290 295 300 

Met Tyr Glu Thr Met Glu Gin Leu Arg Asn Leu Pro Gin Gin Cys 

305 310 315 i 

Met Gin Glu Asp Phe Leu Val His Glu Val Thr Asn Leu Pro Val 
320 325 330 

Thr Glu Ala Leu He Glu Arg Glu Asn Ala Ala Gin Leu Lys Lys 
335 340 345 

Trp Arg Glu Thr Arg Gly Glu Leu Gin Tyr Arg Pro Ser Arg Arg 
350 355 360 

Leu His Gly Ser Arg Ala He His Glu Glu Arg Arg- Arg Gin Gin 
365 370 375 

Pro Pro Leu Gly Pro Ser Ser Ser Leu Leu Ser Leu Pro Gly Leu 
380 385 390 

Lys Ser Arg Gly Ser Arg Ala Ala Gly Gly Ala Pro Ser Pro Pro 
395 400 405 

Pro Pro Val Arg Arg Ala Ser Ala Gly Pro Ala Pro Gly Pro Val 
410 415 420 

Val Thr Ala Glu Gly Leu His Pro Ser Leu Pro Ser Pro Thr Gly 
425 430 435 

Asn Ser Thr Pro Leu Gly Ser Ser Lys Glu Thr Arg Lys Gin Glu 
440 445 45Q 

Lys Glu Arg Gin Lys Gin Glu Lys Glu Arg Gin Lys Gin Glu Lys 
455 460 465 

Glu Arg Glu Lys Glu Arg Gin Lys Gin Glu Lys Glu Arg Glu Lys 
470 475 480 
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Gin 


Glu 


Lys 


Glu Arg 
485 


pi ^ 

(j-LU 


Lys 


La xli 


Glu 


Lys 


Lys 


Ala Gin 
500 


Gly 


Arg 


Lys 


Asp 


Gly 


Pro 


Pro Gly 
515 


Pro 


His 


Asp 


Glu 


Ala 


Arg 


Gin Asp 
530 


Ala 


Tyr 


Phe 



<210> 12 

<211> 160 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eacure 

<223> Incyte ID No: 3215187CD1 

<400> 12 

Met Ala Phe Thr Phe Ala Ala Phe_ 

1 * * 5 
Leu Cys Ala Ala Leu lie Phe Phe 

20 

Phe Asp Glu Leu Arg Thr Asp Phe 
35 

Asn Pro Val His Ala Arg Glu Arg 
50 

Cys Phe Leu Leu Arg Lys Leu Val 
65 

Ser Leu Phe Cys lie Met Phe Leu 
80 

Leu Gly Leu Asn Val Pro Leu Leu 
95 

Phe His Cys Pro Ala Asp Ser Ser 
110 

Val Val Met Asn Ala Asp Thr Leu 
125 

Trp Cys Lys Leu Ala Phe Tyr Leu 
140 

Tyr Cys Met lie Tyr Thr Leu Val 
155 

<210> 13 

<211> 531 

<212> PRT 

<2 13 > Homo sapiens 



Glu Lys Glu Arg Gin Lys Gin 
490 495 

Leu Ser Leu Arg Arg Lys Ala 
505 510 

Gly Gly Asp Arg Pro Ser Ala 
520 525 



Cys 


Tyr 
10 


Met 


Leu 


Ser 


Leu 


Val 
15 


Ala 


lie 
25 


Trp 


His 


He 


He 


Ala 
30 


Lys 


Ser 
40 


Pro 


He 


Asp 


Gin 


Cys 
45 


Leu 


Arg 

55 


Asn 


He 


Glu 


Arg 


He 
60 


Leu 


Pro 
70 


Glu 


Tyr 


Ser 


He 


His 
75 


Cys 


Ala 


Gin 


Glu 


Trp 


Leu 


Thr 


85 










90 


Phe 


Tyr 
100 


His 


Phe 


Trp 


Arg 


Tyr 
105 


Glu 


Leu 
115 


Ala 


Tyr 


Asp 


Pro 


Pro 
120 


Ser 


Tyr 
130 


Cys 


Gin 


Lys 


Glu 


Ala 
135 


Leu 


Ser 
145 


Phe 


Phe 


Tyr 


Tyr 


Leu 
150 


Ser 


Ser 
160 













<220> 

<221> misc_f eature 

<223> Incyte ID No: 3500375CD1 

<400> 13 



Met 


Ala 


Asp 


Val 


Leu 


Ser 


Val 


Leu 


1 








5 








Lys 


Glu 


He 


Val 


Val 


Lys 


Gly 


Asp 










20 








Ser 


Trp 


Pro 


Lys 


Asn 


Val 


Lys 


Thr 










35 








Gly 


Lys 


Glu 


Gly 


Gin 


Pro 


Arg 


Glu 










50 








Leu 


Phe 


Leu 


Leu 


Asn 


Asn 


Val 


His 










65 








Arg 


Arg 


Ala 


Ala 


Thr 


Glu 


Asn 


He 










80 








Arg 


Lys 


Asp 


Leu 


Leu 


Gly 


Tyr 


Leu 



Arg 


Gin 


Tyr 


Asn 


He 


Gin 


Lys 




10 










15 


Glu 


Val 


He 


Phe 


Gly 


Glu 


Phe 




25 










30 


Asn 


Tyr 


Val 


Val 


Trp 


Gly 


Thr 




40 










45 


Tyr 


Tyr 


Thr 


Leu 


Asp 


Ser 


He 




55 










60 


Leu 


Ser 


His 


Pro 


Val 


Tyr 


Val 




70 










75 


Pro 


Val 


Val 


Arg 


Arg 


Pro 


Asp 




85 










90 


Asn 


Gly 


Glu 


Ala 


Ser 


Thr 


Ser 
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Arg. Ser Phe ^Val Asp Ala Glu VaJ. Asp Yal Thr Arg Asp He Val 

215 _ , _ 220 



WO 01/07471 

PCT/US00/19948 

95 100 105 

Ala Ser He Asp Arg Ser Ala Pro Leu Glu He Gly Leu Gin Arg 

110 US ion s 

Ser Thr Gin Val Lys Arg Ala Ala Asp Glu Val Leu. Ala Glu Ala 
125 . ; 130 135 

Lys Lys Pro Arg lie Glu Asp Glu Glu Cys Val' Arg Leu Asp Lys ' 

140 145 X * Q 

Glu Arg Leu Ala Ala Arg "Leu Glu Gly His^Lys Glu dly He Val 

I 55 160 165 " 

Gin Thr Glu Gin He: Arg Ser Leu Ser Qlu Ala Met Ser Val Glu 

I 70 , * 175 t 1 go V 

Lys He Ala Ala He Lys. Ala Lys He, Met Ala Lys Lys' Arg Ser ' - 

185 i9o' i 95 

Thr He Lys Thr Asp Leu Asp Asp Asp He Thr Ala Leu Lys Gin 
200 f . 205 f 210 

Val 

Ser Arg Glu Arg Val Trp Arg Thr Arg Thr *hr lie Leu Gin Ser 

Thr Gly Lys Asn Phe Ser Lys Asn tie Phe Ala He Leu Gin Ser 

245 250 255 

Val Lys Ala Arg Glu Glu Gly Arg Ala Pro Glu Gin 'Arg Pro Ala 

260 265 270 

Pro Asn Ala Ala Pro Val Asp Pro Thr Leu Arg Thr Lys Gin Pro 

275 280 285 

He Pro Ala Ala Tyr Asn Arg Tyr Asp Gin Glu Arg Phe Lys Gly 

290 295 300 

Lys Glu Glu Thr Glu Gly Phe Lys He Asp Thr Met Gly Thr Tyr 

305 310 315 

His Gly Met Thr Leu Lys Ser Val Thr Glu Gly Ala Ser Ala Arg 

320 325 33 2 

Lys Thr Gin Thr Pro Ala Ala Gin Pro Val Pro Arg Pro Val Ser 

335 340 345 

Gin Ala Arg Pro Pro Pro Asn Gin Lys Lys Gly Ser Arg Thr Pro 

35 ^ 355 
He He He He Pro Ala Ala Thr Thr Ser Leu He Thr Met Leu 

365 370 375 

Asn Ala Lys Asp Leu Leu Gin Asp Leu Lys Phe Val Pro Ser Asp 

380 385 390 

Glu Lys Lys Lys Gin Gly Cys Gin Arg Glu Asn Glu Thr Leu He 

395 400 405 

Gin Arg Arg Lys Asp Gin Met Gin Pro Gly Gly Thr Ala He Ser 

410 415 420 

Val Thr Val Pro Tyr Arg Val Val Asp Gin Pro Leu Lys Leu Met 

425 430 435 

Pro Gin Asp Trp Asp Arg Val Val Ala Val Phe Val Gin Gly Pro 

440 44 5 450 

Ala Trp Gin Phe Lys Gly Trp Pro Trp Leu Leu Pro Asp Gly Ser 

Pro Val Asp He Phe Ala Lys He Lys Ala Phe His Leu Lys JJr 

470 47 $ 480 

Asp Glu Val Arg Leu Asp Pro Asn Val Gin Lys Trp Asp Val Thr 

Val Leu Glu Leu Ser Tyr His Lys Arg His Leu Asp Arg Pro "l 
500 5 05 510 

Phe Leu Arg Phe Trp Glu Thr Leu Asp Arg Tyr Met Val Lys His 

515 520 5 25 

Lys Ser His Leu Arg Phe 

530 

<210> 14 
<211> 165 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 5080410CD1 

<400> 14 



Met 


Ala 


Ser 


Met 


Arg 


Glu 


Ser 


Asp 


X 








5 








Lys 


Leu Gly 


Ala 


Thr 


Asp 


Glu 


Leu 








20 








Ser 


Leu 


Leu 


Thr 


Ala 
35 


Ala 


Val 


He 


His 


Gly 


Leu 


Ser 


Ser 
50 


Ala 


Val 


Lys 


Leu 


His 


Leu 


Pro 


Arg 

O _J 


Arcr 


Thr 


Val 


Met 


Lys 


Gly 


Ala 


Leu 


Met 


Glu 


He 










- 80 








Ser 


Asp Pro 


Trp 


Val 


Leu 


Met 


Val 










95 








Pro 


Asp 


Thr 


Gly 


Ser 
110 


Leu 


Asn 


Leu 


Asn 


Val 


Gin 


Asp 


He 
125 


Leu 


Gly 


Glu 


Cys 


Glu 


Ala 


Ser 


Ala 


Met 


Leu 


Pro 








140 








Lys 


Asn 


Ala 


Ala 


Asp 


Asp 


Pro 


Arg 








155 









<210> 15 
<211> 199 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc.feature 

<223> Incyte ID No: 5218248CD1 



Thr 


Gly 


Leu- 


Trp 


Leu 


His Asn 




10 








15 


Trp 


Ala 


Pro 


Pro 


Ser 


lie Ala 




25 








30 


Asp 


Asn 


He 


Arg 


Leu 


Cys Phe 




40 








.45 


Leu 


Lys 


Leu 


Leu 


Leu 


Sly Thr 




55 




t 


He 


60 


Asp 


Glu 


His 


Pro 


lieu Pro 


7.0 








75 


He 


Gin 


Leu 


Ala 


Ser 


Leu Asp 




85 








.90 


Ala 


Asp 


He 


Leu 


Lys 


Ser Phe 




100 








- , 105 


Glu 


Leu 


Glu 


Glu 


Gin 


Asn Pro 




115 








120 


Leu 


Arg 


Glu 


Lys 


Val 


Gly Glu 




130 








135 


Leu 


Glu 


Cys 


Gin 


Tyr 


Leu Asn 




145 








150 


Gly 


Thr 


Pro 


His 


Ser 


Pro Gly 


160 








165 



<400> 15 
























Ala 


Ala 


Met 


Ser 


Asn 


Met 


Glu 


Lys 


His 


Leu 


Phe 


Asn 


Leu 


Lys 


Phe 


1 








5 








10 










15 


Lys 


Glu 


Leu 


Ser 


Arg 


Ser 


Ala 


Lys 


Lys 


Cys 


Asp 


Lys 


Glu 


Glu 


Lys 








20 










25 










30 


Ala 


Glu 


Lys 


Ala 


Lys 


He 


Lys 


Lys 


Ala 


He 


Gin 


Lys 


Gly 


Asn 


Met 








35 










40 










45 


Glu 


Val 


Ala 


Arg 


He 


His 


Ala 


Glu 


Asn 


Ala 


He 


Arg 


Gin 


Lys 


Asn 








50 










55 










60 


Gin 


Ala 


Val 


Asn 


Phe 


Leu 


Arg 


Met 


Ser 


Ala 


Arg 


Val 


Asp 


Ala 


Val 








65 










70 










75 


Ala 


Ala 


Arg 


Val 


Gin 


Thr 


Ala 


Val 


Thr 


Met 


Gly 


Lys 


Val 


Thr 


Lys 








80 










85 










90 


Ser 


Met 


Ala 


Gly 


Val 


Val 


Lys 


Ser 


Met 


Asp 


Ala 


Thr 


Leu 


Lys 


Thr 








95 










100 










105 


Met 


Asn 


Leu 


Glu 


Lys 


He 


Ser 


Ala 


Leu 


Met 


Asp 


Lys 


Phe 


Glu 


His 








110 










115 










120 


Gin 


Phe 


Glu 


Thr 


Leu 


Asp 


Val 


Gin 


Thr 


Gin 


Gin 


Met 


Glu 


Asp 


Thr 








125 










130 










135 


Met 


Ser 


Ser 


Thr 


Thr 


Thr 


Leu 


Thr 


Thr 


Pro 


Gin 


Asn 


Gin 


Val 


Asp 








140 










145 










150 


Met 


Leu 


Leu 


Gin 


Glu 


Met 


Ala 


Asp 


Glu 


Ala 


Gly 


Leu 


Asp 


Leu 


Asn 








155 










160 










165 


Met 


Glu 


Leu 


Pro 


Gin 


Gly 


Gin 


Thr 


Gly 


Ser 


Val 


Gly 


Thr 


Ser 


Val 










170 










175 










180 


Ala 


Ser 


Ala 


Glu 


Gin 


Asp 


Glu 


Leu 


Ser 


Gin 


Arg 


Leu 


Ala 


Arg 


Leu 
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185 190 195 



<210> 16 
<211> 168 
<212> PRT 

<213> Homo sapiens * 

? 

<220> / 

<221> 'miscj.feature 

<223> Incyte'lD No: Q58336CD1*' 



<400> 16 . ' 

Met Ala Phe Asn Asp Kys Phe Ser Leu Asn Tyr Pro Gly Asn Pro 

Cys Pro Gly Asp Leu He Glu Val Phe Arg Pro Gly Tyr Gin His 

20 v . . 25 "30 

Trp Ala Leu Tyr Leu Gly AspGly-Tyr Val He Asn He Ala. Pro 

35 i 40 4 c 

Val Asp Gly He Pro Ala Ser Phe Thr Ser Ala Lys Ser Val Phe 

Ser Ser Lys Ala Leu Val Lys Met Gin Leu Leu Lys Asp Val Val 

Gly Asn Asp Thr Tyr Arg He Asn Asn Lys Tyr Asp Glu Thr Tyr 

Pro Pro Leu Pro Val Glu Glu He He Lys Arg Ser Glu Phe Val 
95 1°0 ins 

He Gly Gin Glu Val Ala Tyr Asn Leu Leu Val Asn Asn Cys Glu 

110 115 120 

H 1S Phe Val Thr Leu Leu Arg Tyr Gly Glu Gly Val Ser Glu Gin 

125 130 135 

Ala Asn Arg Ala He Ser Thr Val Glu Phe Val Thr Ala Ala Val 

140 l 4 ^ 150 

Gly Val Phe Ser Phe Leu Gly Leu Phe Pro Lys Gly Gin Arg Ala 

155 160 165 

Lys Tyr Tyr ±0 ° 



<210> 17 

<211> 162 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1511488CD1 

<400> 17 



Met 
1 


Leu 


Arg 


Ala 


Val 


Gly 


Ser 


Leu 


Thr 


Val 


Arg 


Cys 


5 

Gly 


Pro 


Gly 


Ala 










20 








Pro 


Ala 


Pro 


Ala 


Leu 


Pro 


Pro 


Arg 










35 






Gly 


Gly 


Ala 


Pro 


Ser 


Asn 


Ser 


Gly 










50 






His 


Arg 


Val 


Pro 


Thr 


Gin 


Arg 


Arg 










65 




He 


Leu 


Leu 


Trp 


Thr 


Gly 


Arg 


Phe 










80 








Pro 


Arg 


He 


Pro 


Pro 


Glu 


Met 


He 










95 








Arg 


Val 


Lys 


Ala 


Cys 


Tyr 


He 


Met 










110 









Leu 


Arg 


Leu 


Gly Arg 


Gly 


Leu 




10 








15 


Pro 


Leu 


Glu 


Ala 


Thr 


Arg 


Arg 


Gly 


25 










30 


Leu 


Pro 


Cys 


Tyr 


Ser 


Ser 




40 










45 


Pro 


Gin 


Gly 


His 


Gly Glu 


He 




55 










60 


Pro 


Ser 


Gin 


Phe 


Asp 


Lys 


Lys 




70 










75 


Lys 


Ser 


Met 


Glu 


Glu 


He 


Pro 




85 










90 


Asp 


Thr 


Ala 


Arg 


Asn 


Lys 


Ala 




100 








105 


He 


Gly 


Leu 


Thr 


He 


He 


Ala 




115 










120 
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Cvs Phe Ala Val He Val Ser Ala Lys Arg Ala Val Glu Arg His 
y 125 130 135 

Glu Ser Leu Thr Ser Trp Asn Leu Ala Lys Lys Ala Lys Trp Arg 
140 145 ! 150 

Glu Glu Ala Ala Leu Ala : Ala Gin . Ala Lys Ala Lys 

155 > 160 

<210> 18 ■< f 

<211> 246 £ ■ 

<212> PRT ' 
<213> Homo sapiens ^ 

<220> . ' • 

<221> misc_f eature ■ * / t 

<223> Incyte vXD No: 1638819CD1 > . 

<400> 18 " n " . ■ 

Met Ala Gly Tyr Leu Lys Leu Val Cys Val Ser Phe Gin Arg Gin 
1 - 5 10 , r 15 

Glv Phe His Thr Val Gly Ser Arg Cys Lys Asn Arg Thr Gly Ala 
V ' . 20 ' " - 25. *■ " 30 

Glu His Leu Trp Leu Thr * Arg His . Leu Arg Asp Pro/- Phe Val Lys 
35 40 45 

Ala Ala Lys Val Glu Ser; Tyr Arg Cys Arg Ser Ala ^ Phe Lys Leu 
50 55 60 

Leu Glu Val Asn Glu Arg His Gin He Leu Arg Pro Gly Leu Arg 
65 70 75 

Val Leu Asp Cys Gly Ala Ala Pro Gly Ala Trp Ser Gin Val Ala 
80 85 90 

Val Gin Lys Val Asn Ala Ala Gly Thr Asp Pro Ser Ser Pro Val 
95 100 105 

Gly Phe Val Leu Gly Val Asp Leu Leu His He Phe Pro Leu Glu 
HO 115 120 

Gly Ala Thr Phe Leu Cys Pro Ala Asp Val Thr Asp Pro Arg Thr 
125 130 135 

Ser Gin Arg He Leu Glu Val Leu Pro Gly Arg Arg Ala Asp Val 
140 145 150 

He Leu Ser Asp Met Ala Pro Asn Ala Thr Gly Phe Arg Asp Leu 
155 160 165 

Asp His Asp Arg Leu He Ser Leu Cys Leu Thr Leu Leu Ser Val 
170 175 180 

Thr Pro Asp He Leu Gin Pro Gly Gly Thr Phe Leu Cys Lys Thr 
185 190 195 

Trp Ala Gly Ser Gin Ser Arg Arg Leu Gin Arg Arg Leu Thr Glu 
200 205 210 

Glu Phe Gin Asn Val Arg He He Lys Pro Glu Ala Ser Arg Lys 
215 220 225 

Glu Ser Ser Glu Val Tyr Phe Leu Ala Thr Gin Tyr His Gly Arg 
230 235 240 

Lys Gly Thr Val Lys Gin 
245 

<210> 19 

<211> 483 

<212> PRT 

<213> Homo sapiens 



<220> 

<2 21> misc_f eature 

<223> Incyte ID No: 1655123CD1 

<400> 19 

Met Glu Glu Gly Gly Gly Gly Val Arg Ser Leu Val Pro Gly Gly 
15 10 15 

Pro Val Leu Leu Val Leu Cys Gly Leu Leu Glu Ala Ser Gly Gly 



15/93 



WO 01/07471 

20 25 30 

Gly Arg Ala Leu Pro Gin Leu Ser Asp Asp He Pro Phe Arg Val 

35 40 45 

Asn Trp Pro Gly Thr Glu Phe Ser Leu Pro Thr Thr Gly Val Leu 

50 r 55 6t0 

Tyr Lys Glu Asp Asn Tyr Val lie Wet Thr Thr Ala His„Lys Glu 

65 70 v 75 

Lys Tyr Lys Cys lie Leu Pro \ Leu Val Thr Ser Gly Asp Glu Glu 

80 85 9 0 

Glu Glu Lys Asp Tyr Lys- Gly Pro Asn Pro Arg Glu" Leu Leu Glu 

95 , ioo , i los 

Pro Leu Phe Lys Gin Ser Ser Cys Ser^Tyr,-Arg He Glu Ser' Tyr 

" .no v * 115 

Trp Thr Tyr Glu Val Cys His v Gly Lys His lie Arg Gin Tyr His 

? 125 130 135 

Glu Glu Lys Glu Thr Gly Gin Lys He Asn He His Glu Tyr Tvr 

14P 1 ' 145. * ^ 

Leu Gly Asn Met Leu Ala Lys Asn Leu Leu Phe Glu Lys Glu Arg 

155 i 160 i 165 

Glu Ala Glu Glu Lys Glu Lys - Ser Asn Glu He Pro Thr Xys Asn 

* 170 * l* 75 180 

He Glu Gly Gin Met Thr Pro Tyr Tyr Pro Val Gly Met Gly Asn 

185 190 195 

Gly Thr Pro Cys Ser Leu Lys Gin Asn Arg Pro Arg Ser Ser Thr 

200 205 210 

Val Met Tyr He Cys His Pro Glu Ser Lys His Glu He Leu Ser 

215 220 225 

Val Ala Glu Val Thr Thr Cys Glu Tyr Glu Val Val He Leu Thr 

230 235 240 

Pro Leu Leu Cys Ser His Pro Lys Tyr Arg Phe Arg Ala Ser Pro 
245 250 255 

Val Asn Asp He Phe Cys Gin Ser Leu Pro Gly Ser Pro Phe Lys 
260 265 270 

Pro Leu Thr Leu Arg Gin Leu Glu Gin Gin Glu Glu He Leu Arg 
275 280 285 

Val Pro Phe Arg Arg Asn Lys Glu Glu Asp Leu Gin Ser Thr Lvs 
290 295 300 

Glu Glu Arg Phe Pro Ala He His Lys Ser He Ala He Gly Ser 
305 310 * 315 

Gin Pro Val Leu Thr Val Gly Thr Thr His He Ser Lys Leu Thr 
320 325 330 

Asp Asp Gin Leu He Lys Glu Phe Leu Ser Gly Ser Tyr Cys Phe 
335 340 345 

Arg Gly Gly Val Gly Trp Trp Lys Tyr Glu Phe Cys Tyr Gly Lys 
350 355 36O 

His Val His Gin Tyr His Glu Asp Lys Asp Ser Gly Lys Thr Ser 
365 370 375 

Val Val Val Gly Thr Trp Asn Gin Glu Glu His He Glu Trp Ala 
380 385 3go 

Lys Lys Asn Thr Ala Arg Ala Tyr His Leu Gin Asp Asp Gly Thr 
395 400 405 

Gin Thr Val Arg Met Val Ser His Phe Tyr Gly Asn Gly Asp He 
410 415 420 

Cys Asp He Thr Asp Lys Pro Arg Gin Val Thr Val Lys Leu Lys 
425 430 435 

Cys Lys Glu Ser Asp Ser Pro His Ala Val Thr Val Tyr Met Leu 
440 445 450 

Glu Pro His Ser Cys Gin Tyr lie Leu Gly Val Glu Ser Pro Val 
455 460 465 

He Cys Lys He Leu Asp Thr Ala Asp Glu Asn Gly Leu Leu Ser 
470 475 4 8 o 

Leu Pro Asn 



PCT/US00/19948 
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<210> 20 
<211> 280 
<212> PRT, 

<213> Honio sapiens 
<22 0> 

<221> misc„f eature 

<223> Incyte ID No: 2553926CD1 

<400>: 20. 



Met 


Glu 


Ala 


Ala 


Glu 


Thr 


Glu 


Ala 


Glu 


Ala 


Ala 


Ala 


Leu 


Glu 


Val 


1 








5 










10 










15 


Leu 


Ala 'Glu 


Val 


Ala 


Gly He 


Leu 


Glu 


Pro 


Val 


Gly 


Leu 


Gin 


Glu 










20 










25 










30 


Glu 


Ala 


Glu 


Leu 


Pro .Ala 


Lys 


He 


Leu 


Val 


Glu 


Phe 


Val 


Val 


Asp 










35 










40 










45 


Ser 






Lys 


Asp 


Lys 


Leu 


Leu 


Cys 


Ser 
55 


Gin 


Leu 


Gin 


Val 


Ala 
60 


Asp 


Phe 


Leu 


Gin 


Asn 


He 


Leu 


Ala 


Gin 


Glu "Asp 


Thr 


Ala. 


Lys 


Gly 








U -J 










70 










75 


Leu 


Asp 


Pro 


Leu 


Ala 


Ser 


Glu 


Asp 


Thr 


Ser 


Arg 


Gin 


Lys 


Ala 


He 








ft n 

O V 










85 










90 


Ala 


Ala 


Lys 


Glu 


Gin 




Lys 


Glu 


Leu 


Lys 


Ala 


Thr 


Tyr 


Arg 


Glu 


















100 










105 


His 


Val 


Glu 


Ala 


He 

JL J. \J 


Lys 


lie 


Gly 


Leu 


Thr 
115 


Lys 


Ala 


Leu 


Thr 


Gin 
120 


Met 


Glu 


Glu 


Ala 


Gin 
125 


Arg 


Lys 


Arg 


Thr 


Gin 
130- 


Leu 


Arg 


Glu 


Ala 


Phe 
135 


Glu 


Gin 


Leu 


Gin 


Ala 
140 


Lys 


Lys 


Gin 


Met 


Ala 
145 


Met 


Glu 


Lys 


Arg 


Arg 

150 


Ala 


Val 


Gin 


Asn 


Gin 
155 


Trp 


Gin 


Leu 


Gin 


Gin 
160 


Glu 


Lys 


His 


Leu 


Gin 
165 


His 


Leu 


Ala 


Glu 


Val 
170 


Ser 


Ala 


Glu 


Val 


Arg 
175 


Glu 


Arg 


Lys 


Thr 


Gly 
180 


Thr 


Gin 


Gin 


Glu 


Leu 
185 


Asp 


Gly 


Val 


Phe 


Gin 
190 


Lys 


Leu 


Gly 


Asn 


Leu 
195 


Lys 


Gin 


Gin 


Ala 


Glu 


Gin 


Glu 


Arg 


Asp 


Lys 


Leu 


Gin 


Arg 


Tyr 


Gin 








200 










205 










210 


Thr 


Phe 


Leu 


Gin 


Leu 
215 


Leu 


Tyr 


Thr 


Leu 


Gin 
220 


Gly 


Lys 


Leu 


Leu 


Phe 
225 


Pro 


Glu 


Ala 


Glu 


Ala 
230 


Glu 


Ala 


Glu 


Asn 


Leu 
235 


Pro 


Asp 


Asp 


Lys 


Pro 
240 


Gin 


Gin 


Pro 


Thr 


Arg 
245 


Pro 


Gin 


Glu 


Gin 


Ser 
250 


Thr 


Gly 


Asp 


Thr 


Met 
255 


Gly Arg 


Asp 


Pro 


Gly 


Val 


Ser 


Phe 


Lys 


Phe 


Ser 


Lys 


Ala 


Val 


Gly 










260 










265 










270 


Leu 


Gin 


Pro 


Ala 


Gly 
275 


Asp 


Val 


Asn 


Leu 


Pro 
280 













<210> 21 
<211> 425 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2800717CD1 

<400> 21 

Met Gly Glu Asp Ala Ala Gin Ala Glu Lys Phe Gin His Pro Gly 
15 10 15 

Ser Asp Met Arg Gin Glu Lys Pro Ser Ser Pro Ser Pro Met Pro 
20 25 30 

Ser Ser Thr Pro Ser Pro Ser Leu Asn Leu Gly Asn Thr Glu Glu 
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35 40 45 

Ala He Arg Asp Asn Ser Gin Val Asn Ala Val Thr Val Leu Thr 

50 55 60 * 

Leu Leu Asp Lys Leu Val Asn Met Leu Asp Ala Val Gin Glu Asn 

- 65 TrO 75 T 

Gin His Lys Met Glu Gin Arg" Gin lie Ser Leu Glu Gly *'Ser Val * 

80 * 85 90 

Lys, Gly He Gin Asn Asp Leu Thr Lys Leu Ser Lys Tyr ,Gln Ala 

' 95 . 100 ■ 105 ( 

Ser Thr Ser Asn Thr Val Ser Lys Leu Leu Glu Lys Ser Arg Lys 

110 - ' 115 . 120 ^ 

Val Ser Ala His Thr Arg Ala. Val Lys Glu Arg Met Asp Arg Gin v ' 

125 , 130 135 - ' 

Cys Ala Gin Val Lys Arg Leu Glu Asn; Asn His Ala Gin Leu Leu 

140 - ^ 145 150 

Arg Arg Asn His Phe Lys Val Leu He Phe Gin Glu Glu Asn Glu 

• . 155 -: * 160 3 , ' 165 > 

He Pro Ala Ser Val Phe Val Lys Gin Pro Val Ser GlyAla : Val ' 

170 175 ' 180 

Glu Gly Lys Glu Glu Leu Pro Asp Glu Asn Lys Ser Leu Glu Glu 

185 190 195 

Thr Leu His Thr Val Asp Leu Ser Ser Asp Asp Asp Leu Pro His 

200 205 210 

Asp Glu Glu Ala Leu Glu Asp Ser Ala Glu Glu Lys Val Glu Glu 

215 220 225 

Ser Arg Ala Glu Lys He Lys Arg Ser Ser Leu Lys Lys Val Asp 

230 235 240 

Ser Leu Lys Lys Ala Phe Ser Arg Gin Asn He Glu Lys Lys Met 

245 250 255 

Asn Lys Leu Gly Thr Lys He Val Ser Val Glu Arg Arg Glu Lys 

260 265 270 

He Lys Lys Ser Leu Thr Ser Asn His Gin Lys He Ser Ser Gly 

275 280 285 

Lys Ser Ser Pro Phe Lys Val Ser Pro Leu Thr Phe Gly Arg Lys 

2 90 295 300 

Lys Val Arg Glu Gly Glu Ser His Ala Glu Asn Glu Thr Lys Ser 

305 310 315 

Glu Asp Leu Pro Ser Ser Glu Gin Met Pro Asn Asp Gin Glu Glu 

320 325 330 

Glu Ser Phe Ala Glu Gly His Ser Glu Ala Ser Leu Ala Ser Ala 

335 340 345 

Leu Val Glu Gly Glu He Ala Glu Glu Ala Ala Glu Lys Ala Thr 

350 355 360 

Ser Arg Gly Ser Asn Ser Gly Met Asp Ser Asn He Asp Leu Thr 

365 370 375 

He Val Glu Asp Glu Glu Glu Glu Ser Val Ala Leu Glu Gin Ala 

380 385 390 

Gin Lys Val Arg Tyr Glu Gly Ser Tyr Ala Leu Thr Ser Glu Glu 

395 400 405 

Ala Glu Arg Ser Asp Gly Asp Pro Val Gin Pro Ala Val Leu Gin 

410 415 420 

Val His Gin Thr Ser 

425 

<210> 22 

<211> 128 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5664154CD1 

<400> 22 
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Met Glu Ser 


Lys 


Glu 


Glu 


Arg 


Ala 


Leu 


Asn 


Asn 


Leu 


He 


Val 


Glu 


1 




5 










10 










15 


Asn Val Asn 


Gin 


Glu 


Asn 


Asp 


Glu 


Lys 


Asp 


Glu 


Lys 


.Glu 


Gin 


Val 






20 










25 










30 


Ala Asn Lys 


Gly 


Glu 


Pro 


Leu 


Ala 


Leu 


Pro 


Leu 


Asn 


Val 


Ser 


Glu 






. 35 










40 










45 


Tyr Cys Val 


Pro 


Arg 


Gly Asn 


Arg 


Arg 


Arg 


Phe 


Arg 


Val 


•Arg 


Gin 






;50 










55 










60 


Pro lie Leu 


.Gin 


Tyr 


Arg 


Trp 


Asp 


lie 


Met 


His 


Arg 


Leu 


Gly 


Glu 






65 










70 










75 


Pro Gin Ala 


Arg 


Met 


Arg 


Glu. 


Glu 


Asn 


Met 


Glu 


Arg 


Ile > 


Gly 


Glu 






80 




*Glu 






85 










90 


Olu Val Arg 


Gin 


Leu 


Met 


Lys 


Leu 


Arg 


Glu 


Lys 


Gin 


Leu 


Ser 




95 










100 










105 


His Ser Leu 


Arg 


Ala 


Val 


Ser 


Thr 


Asp 


Pro 


;Pro 


His 


His 


Asp. 


His 






110 










115 










120 


His Asp Glu 


Phe 


Cys . 


Leu 


Met 


Pro 




















125 






















<210> 23 


























<211> 113 


























<212> PRT 




















1 






<213> Homo sapiens 






















<220> 








i 


















<221> misc_f eature 






















<223> Incyte ID 


No: 


017900CD1 
















<400> 23 


























Met Asp Gly 


Arg 


Val 


Gin 


Leu 


He 


Lys 


Ala 


Leu 


Leu 


Ala 


Leu 


Pro 


1 




5 










10 










15 


lie Arg Pro 


Ala 


Thr 


Arg 


Arg 


Trp 


Arg 


Asn 


Pro 


He 


Pro 


Phe 


Pro 






20 










25 










30 


Glu Thr Phe 


Asp 


Gly 


Asp 


Thr 


Asp 


Arg 


Leu 


Pro 


Glu 


Phe 


He 


Val 






35 










40 










45 


Gin Thr Gly 


Ser 


Tyr 


Met 


Phe 


Val 


Asp 


Glu 


Asn 


Thr 


Phe 


Ser 


Ser 






50 










55 










60 


Asp Ala Leu 


Lys 


Val 


Thr 


Phe 


Leu 


He 


Thr 


Arg 


Leu 


Thr 


Gly 


Pro 






65 










70 










75 


Ala Leu Gin 


Trp 


Val 


He 


Pro 


Tyr 


He 


Lys 


Lys 


Glu 


Ser 


Pro 


Leu 






80 










85 










90 


Leu Asn Asp 


Tyr 


Arg 


Gly 


Phe 


Leu 


Ala 


Glu 


Met 


Lys 


Arg 


Val 


Phe 






95 










100 










105 


Gly Trp Glu 


Glu 


Asp 


Glu 


Asp 


Phe 




















110 






















<210> 24 


























<211> 308 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> misc_ 


feature 






















<223> Incyte ID 


No: 


035102CD1 
















<400> 24 


























Met Leu Gin 


Thr 


Pro 


Glu 


Ser 


Arg 


Gly 


Leu 


Pro 


Val 


Pro 


Gin 


Ala 


1 




5 










10 










15 


Glu Gly Glu 


Lys 


Asp 


Gly 


Gly 


His 


Asp 


Gly 


Glu 


Thr 


Arg 


Ala 


Pro 






20 










25 










30 


Thr Ala Ser 


Gin 


Glu 


Arg 


Pro 


Lys 


Glu 


Glu 


Leu 


Gly 


Ala 


Gly 


Arg 






35 










40 










45 


Glu Glu Gly 


Ala 


Ala 


Glu 


Pro 


Ala 


Leu 


Thr 


Arg 


Lys 


Gly 


Ala 


Arg 






50 










55 










60 


Ala Leu Ala 


Ala 


Lys 


Ser 


Leu 


Ala 


Arg 


Arg 


Arg 


Ala 


Tyr 


Arg 


Arg 
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65 








Leu 


Asn 


Arg 


Thr 


Val 


Ala 


Glu 


Leu 










80 








Asp 


Lys 


Lys 


* Lys 


Ser. 


Pro 


He 


Thr 


* 


t 






95 








Val 


He 


Gly 


Asp 


Leu 


Lys 


He 


Leu 










aio 








Ala 


Ala 


Glu 


His 


Leu* 


Arg 


Tyr 


Val' 










125? 








Phe 


Asp 


Arg 


Lys 


His 


His 


Thr 


Tyr 










140 








Pro 


Leu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 










155 








Pro 


*Arg 


Leu 


Gly 


Leu 


Leu 


Met 


Met 










il70 








Arg 


Gly 


Asn 


Ser 


Ala 


Arg 


Glu 


Ala 




* 






185 








Arg 


Leu 


Gly Val 


Gin 


Pro 


Ser 


Lvs 










200 








Pro 


Lys 


Arg 


Leu 


He 


Met 


Glu 


Asp 










215 








Ser 


Tyr 


Arg 


Arg 


Val 


Pro 


His 


Thr 










230 








Ser 


Trp Gly 


Pro 


Arg 


Ser 


Asn 


Leu 










245 








Leu 


Gly 


Phe 


Val 


Ala 


Lvs 


Leu 


ftis 










260 








Pro 


Val 


Gin 


Tyr 


Arg 


Glu 


Ala 


Leu 










275 








Arg 


Ala 


Lys 


Ala 


Arg 


Ala 


Glu 


Ala 










290 








Ala 


Arg 


Ala 


Gly 


He 


His 


Leu 


Trp 










305 









<210> 25 

<211> 221 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 259983CD1 

<400> 25 



Met 


Phe 


Gly 


Phe 


His 


Lys 


Pro 


Lys 


1 








5 








Cys 


Cys 


He 


Cys 


Arg 
20 


Ala 


Lys 


Ser 


Ser 


Lys 


Arg 


Tyr 


Glu 
35 


Lys 


Asp 


Phe 


Glu 


Thr 


Arg 


Ser 


Gly 
50 


Asp 


He 


Cys 


Lys 


Arg 


Trp 


Lys 


Lys 
65 


Leu 


Pro 


Ala 


His 


Val 


Val 


Asp 


Ala 
80 


Arg 


Ala 


Gly 


Lys 


Pro 


Lys 


Lys 


Val 
95 


Lys 


Thr 


Leu 


Asn 


Gin 


He 


Ser 


Lys 
110 


Leu 


Gin 


Lys 


Asp 


Ala 


His 


Ser 


Thr 
125 


Thr 


Ser 


Ser 


Cys 


Tyr 


Ser 


Asn 


Gin 


Ser 


Asp 


Asp 



140 





70 










75 


Val 


Gin 


Phe 


Leu 


Leu 


Val 


Lys 




, 85 










90 


.Arg 


Ser 


Glu 


Met 


Val 


Lys 


* Tvr 




100 










105 


Phe 


Pro 


Asp 


lie 


lie 


Ala 


Arg 




115 










120 


Phe 


Gly 


Phe 


<Glu 


Leu 


Lys 


Gin 




130 










135 


He 


Le^ : 


lie 


Asn 


Lys 


Leu 


Lys 




145 


Leu 




i 




'150 


Glu Asp 


Gly Gly Asp Gly 




160 










165. 


He 


Leu 


Gly 


Leu 


lie 


Tyr 


Met 




175 








180 


Gin 


Val 


Trp 


Glu 


Met 


Leu 


Arg 




190 










195 


Tyr 


His 


Phe 


Leu 


Phe 


Gly Tyr 




205 










210 


Phe 


Val 


Gin 


Gin 


Arg 


Tyr 


Leu 




220 










225 


Asn 


Pro 


Pro 


Ala 


Tyr 


Glu 


Phe 




235 










240 


Glu 


He 


Ser 


Lys 


Met 


Glu 


Val 




250 










255 


Lys 


Lys 


Glu 


Pro 


Gin 


His 


Trp 




265 










270 


Ala 


Asp 


Glu 


Ala 


Asp 


Arg 


Ala 




280 










285 


Ser 


Met 


Arg 


Ala 


Arg 


Ala 


Ser 




295 










300 



Met 


Tyr 


Arg 


Ser 


He 


Glu 


Gly 




10 










15 


Ser 


Ser 


Ser 


Arg 


Phe 


Thr 


Asp 




25 










30 


Gin 


Ser 


Cys 


Phe 


Gly 


Leu 


His 




40 










45 


Asn 


Ala 


Cys 


Val 


Leu 


Leu 


Val 




55 










60 


Gly 


Ser 


Lys 


Lys 


Asn 


Trp 


Asn 




70 










75 


Pro 


Ser 


Leu 


Lys 


Thr 


Thr 


Leu 




85 










90 


Ser 


Gly Asn 


Arg 


He 


Lys 


Ser 




100 










105 


Glu 


Phe 


Lys 


Arg 


His 


Asn 


Ser 




115 










120 


Ala 


Ser 


Pro 


Ala 


Gin 


Ser 


Pro 




130 










135 


Gly 


Ser 


Asp 


Thr 


Glu 


Met 


Ala 




145 










150 
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ser Gly Ser Asn Arg Thr Pro Val Phe Ser Phe Leu Asp Leu Thr 

155 160 165 

Tvr Trp Lys Arg Gin Lys lie Cys Cys Gly lie He Tyr Lys Gly 

170 175 180 

Ara Phe Gly Glu Val Leu He Asp Thr His Leu Phe Lys Pro Cys 

185 190 ; 195 r - 

Cvs Ser Asn Lys. Lys Ala Ala Ala Glu Lys Pro Glu Glu Gin Gly 
, .200 205 210 

Pro Glu Pro Leu Pro He Ser Thr Gin Glu Trp 

215 220 
<210> 26 ; . 

<211> 402 

<212> PRT ' 
<213> Homo sapiens , 

<220> • 
<221> misc^f eature 

<223> Incyte ID No: 926810CD1 ; * 



<400> 26 










Aig 
















Ash 


Met 


Ala 


Ser 


He 


He 


Ala 


Val 


Gly Asn 


Ser 


Arg 


Arg 


Leu 


1 








5 










10 










15 


Ala 


Pro 


Leu 


Pro 


Pro 


Trp 


Ala 


His 


Ser 


Met 


Leu 


Arg 


Ser 


Leu 


Gly 








20 










25 










30 


Arg 


Ser 


Leu 


Gly 


Pro 


He 


Met 


Ala 


Ser 


Met 


Ala 


Asp 


Arg 


Asn 


Met 






35 










40 










45 


Lys 


Leu 


Phe 


Ser 


Gly 
50 


Arg 


Val 


Val 


Pro 


Ala 
55 


Gin 


Gly 


Glu 


Glu 


Thr 
60 


Phe 


Glu 


Asn 


Trp 


Leu 


Thr 


Gin 


Val 


Asn 


Gly 


Val 


Leu 


Pro 


Asp 


Trp 






65 










70 










75 


Asn 


Met 


Ser 


Glu 


Glu 
80 


Glu 


Lys 


Leu 


Lys 


Arg 
85 


Leu 


Met 


Lys 


Thr 


Leu 
90 


Arg 


Gly 


Pro 


Ala 


Arg 


Glu 


Val 


Met 


Arg 


Val 


Leu 


Gin 


Ala 


Thr 


Asn 






95 










100 










105 


Pro 


Asn 


Leu 


Ser 


Val 


Ala 


Asp 


Phe 


Leu 


Arg 


Ala 


Met 


Lys 


Leu 


Val 








110 










115 










120 


Phe 


Gly 


Glu 


Ser 


Glu 


Ser 


Ser 


Val 


Thr 


Ala 


His 


Gly 


Lys 


Pne 


fne 








125 










130 








Val 


135 


Asn 


Thr 


Leu 


Gin 


Ala 


Gin 


Gly 


Glu 


Lys 


Ala 


Ser 


Leu 


Tyr 


He 








140 










145 










150 


Arg 


Leu 


Glu 


Val 


Gin 


Leu 


Gin 


Asn 


Ala 


He 


Gin 


Ala 


Gly 


He 


lie 








155 










160 










165 


Ala 


Glu 


Lys 


Asp 


Ala 


Asn 


Arg 


Thr 


Arg 


Leu 


Gin 


Gin 


Leu 


Leu 


Leu 






170 










175 










180 


Gly 


Gly 


Glu 


Leu 


Ser 


Arg 


Asp 


Leu 


Arg 


Leu 


Arg 


Leu 


Lys 


Asp 


Phe 






185 










190 










195 


Leu 


Arg 


Met 


Tyr 


Ala 


Asn 


Glu 


Gin 


Glu 


Arg 


Leu 


Pro 


Asn 


Phe 


Leu 






200 










205 










210 


Glu 


Leu 


He 


Arg 


Met 


Val 


Arg 


Glu 


Glu 


Glu 


Asp 


Trp 


Asp 


Asp 


Ala 








215 










220 










225 


Phe 


He 


Lys 


Arg 


Lys 


Arg 


Pro 


Lys 


Arg 


Ser 


Glu 


Ser 


Met 


Val 


Glu 








230 










235 










240 


Arg 


Ala 


Val 


Ser 


Pro 


Val 


Ala 


Phe 


Gin 


Gly 


Ser 


Pro 


Pro 


He 


Val 








245 










250 










255 


He 


Gly 


Ser 


Ala 


Asp 


Cys 


Asn 


Val 


He 


Glu 


He 


Asp 


Asp 


Thr 


Leu 








260 










265 










270 


Asp 


Asp 


Ser 


Asp 


Glu 


Asp 


Val 


He 


Leu 


Val 


Glu 


Ser 


Gin 


Asp 


Pro 






275 










280 










285 


Pro 


Leu 


Pro 


Ser 


Trp 
290 


Gly 


Ala 


Pro 


Pro 


Leu 
295 


Arg 


Asp 


Arg 


Ala 


Arg 
300 


Pro 


Gin 


Asp 


Glu 


Val 


Leu 


Val 


He 


Asp 


Ser 


Pro 


His 


Asn 


Ser 


Arg 








305 










310 










315 


Ala 


Gin 


Phe 


Pro 


Ser 


Thr 


Ser 


Gly 


Gly 


Ser 


Gly 


Tyr 


Lys 


Asn 


Asn 
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320 325 . 33 0 

Gly Pro Gly Glu Met Arg Arg Ala Arg Lys Arg Lys His Thr He 

335 340 * 345 

Arg Cys Ser Tyr Cys Gly Glu Glu Gly His Ser Lys Glu Thr Cvs 

350 ' 355 9" - , 36Q 

Asp Asn Glu Ser Asp. Ly^s Ala Gin Val Phe Glu Asn' Leu He He 

365 * , 370 , 375 

Thr Leu Gin Glu Leu Thr His Thr Glu Met Glu Arg Ser Arg Val 
^ 380 385 3Q 0 

Ala Pro Gly Glu Ty K Asn Asp Phe Ser Glu Pro Leu *■ T 

395 . * >' 400 • 

<211> 93 ' ! . f " 

<212> PRT ^ : - , 

<213> Homo sapiens . 1 v 

<220> * , ^ ' . 1 , : 

<221> misc_feature ■ / ' ' * ; * , * ' 

<223> Incyte ID No: 1^398816001 

i ' 
<400> 27 " ; 

Met Ser Thr Asp Thr Gly Val Ser Leu Pro Ser Tyr,' Glu Glu Asp 

5 1 0 * 

Gin Gly ser Lys Leu He Arg Lys Ala Lys Glu Ala Pro Phe Val 

Pro Val Gly He Ala Gly Phe Ala Ala lie Val Ala Tyr Gly Leu 

Tyr Lys Leu Lys Ser Arg Gly Asn Thr Lys Met Ser He His Leu 

He His Met Arg Val Ala Ala Gin Gly Phe Val Val Gly Ala till 

Thr Val Gly Met Gly Tyr Ser Met Tyr Arg Glu Phe Trp Ala Lys 

Pro Lys Pro 85 90 



PCT/US00/19948 



<210> 28 

<211> 353 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1496820CD1 

<400> 28 

Met Asn Arg Glu Asp Arg Asn Val Leu Arg Met Lys Glu Arg Glu 

5 10 is 

Arg Arg Asn Gin Glu He Gin Gin Gly Glu Asp Ala Phe Pro Pro 

Ser Ser Pro Leu Phe Ala Glu Pro Tyr Lys Val Thr Ser Lys Glu 
35 40 *c 

Asp Lys Leu Ser Ser Arg lie Gln S er Met Leu Gly Asn Tyr Asp 

Glu Met Lys Asp Phe He Gly Asp Arg Ser lie Pro Lys Leu vll 

Ala He Pro Lys Pro Thr Val Pro Pro sir Ala Asp Glu Lys Ser 
80 85 a 0 

Asn Pro Asn Phe Phe Glu Gin Arg His Gly Gly Ser His Gin Ser 
95 100 10 5 

Ser Lys Trp Thr Pro Val Gly Pro Ala Pro Ser Thr Ser Gin Ser 

Gin Lys Arg Ser Ser Gly Leu Gin Ser Gly His Ser Ser Gin Arg 
125 130 135 
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Ser 


Ala 


Gly 


Ser 
140 


Ser 


Ser 


Gly 




Asp Arg Glu 


Ser 


Tyr 


Asn 


Asn 










155 








Lys 


Gly 


Gin 


His 


Gly 


Ser 


Glu 


His 








170 








Pro 


Gly 


Liys 


Pro 


Gin Ala 


Val 


Ser 










185 








Arg 


Ser 


His 


Gly 


Asn 


Asp 


His 


His 








200 








Lys 


Ser 


Pro 


Arg 


Asp 


Pro 


Asp 


Ala 








215 








vai 


Pro 


Phe 


Ser 


Ser 
230 


Gly 


Gin 


His 


Ser 


Leu 


Met 


Ser 


Lys 
245 


Ser 


Asn 


Ser 


Tyr- 


Val 


Arg 


Pro 


Met 


Asp 


Gly 


Gin 








260 








Ser 


Ser 


Glu 


His 


Tyr 
275 


Ser 


Ser 


Gin 


Glu 


Leu 


Lys 


Pro 


Ser 
290 


Ser 


Lvs 


Ala 


Pro 


Ser 


Gin 


Pro 


Leu 
305 


Asp 


Ala 


Ser 


Val 


Asp 


Glu 


He 


Leu 
320 


Lys 


Glu 


Met 


Leu 


Thr 


Ala 


He 


His 
335 


Thr 


Pro 


Cys 


Pro 


Phe 


Pro 


Thr 


Lys 
350 


Val 


Ser 


Lys 


<210> 29 













<211> 120 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1514559CD1 

<400> 29 

Met Ser Glu Pro Ala Gly Asp Val 

1 5 < 

Lys Ala Cys Arg Arg Leu Phe Gly 
20 

Ser Arg Asp Cys Asp Ala Leu Met 
35 

Arg Glu Arg Trp Asn Phe Asp Phe 
50 

Gly Asp Phe Ala Trp Glu Arg Val 
65 

Leu Tyr Leu Pro Thr Trp Ser Ala 
80 

Cys Gly Ser Phe Pro Ser Leu Ser 
95 

Pro Phe Pro Gly His Ser Asp Leu 
110 

<210> 30 

<211> 144 

<212> PRT 

<213> Homo sapiens 

<220> 



Thr 


Asn 


Ser 


Ser 


Gly Gin 


Arg 




145 










150 


Ser Gly 


Ser 


Ser 


Ser 


Arg 


Lys 




160 










165 


Ser 


Lys 


Ser^ Arg 


Ser 


Ser 


Ser 




175 










180 


Ser 


Leu 


Asn 


Ser 


Ser 


His 


Ser 




190' 








- 


195 


Ser 


Lys 


Glu 


His 


Gin 


Arg 


Ser 




205 










210 


Asn 


Trp 


Asp 


Ser 


Pro 


Ser 


Arg 




220 










225 


Ser 


Thr 


Gin 


Ser 


Phe 


l Pro 


Pro , 




235 










240- 


Met 


Leu Gin 


Lys 


Pro 


Thr 


Ala 




.250 










255 . 


Glu 


Ser 


Met 


Glu 


Pro 


Lys 


Leu 




2 65 






i 




270 


Ser 


His' 


Gly 


Asn. 


Ser 


Met 


Thr 




280 










285 


His 


Leu 


Thr 


'Lys 


Leu 


Lys 


He 




295 










300 


Ala 


Ser 


Gly 


Asp 


Val 


Ser 


Cys 




310 










315 


Thr 


His 


Ser 


Trp 


Pro 


Pro 


Pro 




325 










330 


Lys, 


Thr 


Glu 


Pro 


Ser 


Lys 


Phe 


340 










345 



ft 



Arg 


Gin 


Asn 


Pro 


Cys 


Gly 


Ser 


10 










15 


Pro 


Val 


ASP 


Ser 


Glu 


Gin 


Leu 




25 










30 


Ala 


Gly 


Cys 


He 


Gin 


Glu 


Ala 




40 










45 


Val 


Thr 


Glu 


Thr 


Pro 


Leu 


Glu 




55 










60 


Arg 


Gly 


Leu 


Gly 


Leu 


Pro 


Lys 


70 










75 


Gly 


Trp 


Tyr 


Pro 


Leu 


Glu 


Gly 


85 










90 


Gin 


Ala 


Val 


Met 


Lys 


Phe 


Thr 




100 










105 


Asn 


Ser 


Phe 


Ser 


Phe 


Glu 


Lys 




115 










120 
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<221> misc__ feature 
<223> Incyte ID No: 1620092CD1 
* 

<400> 30 

Met Arg Ser Cys. Phe Arg Leu Cys Glu- Arg Asp Val Ser Ser Ser 

Leu Arg Leu Thr Arg.. Ser Ser Asp : Leu -Lys Arg. lie Asn Gly Phi 
20 ' ' 25 .- ' ' . 30 

Glu.Ser Pro Gly Ala Pro Ser Arg Thr Tyr 
35 40 x 4-5 



PCT/USOO/19948 



Cys Thr Lys Pro din •■ Glu.Ser Pro Gly Ala Pro Ser Arg Thr Tyr 

Asn Arg Val Pro' Leu -His Lys Pro Thr' Asp^Trp Gin Lys Lys Xll 

55 * ' £n 

Leu lie Trp Ser Gly Arg Phe. Lys Lys Glu Asp Glu He. Pro Glu 

Thr Val 'Ser Leu Glu Met Leu Asp Ala Ala .Lys Asn Lys Met Arg 
80 85 ' . 90 

Cys 

He Phe Met Val lie Glu Gly Lys Lys Al* Ala Gin Arg His gJu 



Val Lys He Ser Tyr Leu Met 'life Ala Leu Thr Val Val Gly cys 



100 

_ _ . Alk 

Thr Leu Thr Ser Leu Asn Leu Glu Lys Lys Ala Arg Leu Lys Glu 

12 5 130 _ _ _ 

Glu Ala Ala. Met Lys Ala Lys Thr Glu 
140 



<210> 31 
<211> 933 
<212> PRT 

<213> Homo sapiens - 
<220> 

<221> misc.feature 

<223> Incyte ID No: 1678765CD1 

<400> 31 

Met Phe Tyr Leu Glu Asp Asp Lys Glu Asp Glu Val Val Cys Lys 

10 

Gly Ser Leu Ser Lys Thr Gin Asp Val Tyr His Asp Lys Ser Pr"o 

Pro Gly He Leu Ser Gin Thr Met Asn Tyl Val Gly Gin Leu Ala 

Gly Gin Val He Val Thr Val Lys Glu LeS Tyr Lys Gly He Asn 

55 - n 
Gin Ala Thr Leu Ser Gly Cys He Asp Val He Val Val Gin Gin 

70 7 c 
Gin Asp Gly Ser Tyr Gin Cys Ser Pro Phe His Val Arg Phe Gly 

Lys Leu Gly Val Leu Arg Ser Lys Glu Lys Val He Asp He Glu 

105 



9 5 100 



He Asn Gly Ser Ala Val Asp Leu His Met Lys Leu Gly Asp Asn 



115 



Gly Glu Ala Phe Phe Val Glu Glu Thr G^u Glu Glu Tyr Glu tyl 



125 



Leu Pro Ala Tyr Leu Ala Thr Ser Pro He Pro Thr Glu Asp G^n 



140 



145 



Phe Phe Lys Asp He Asp Thr Pro Leu Val Lys Ser Gly Gly isp 



155 



160 



Glu Thr Pro Ser Gin Ser Ser Asp He Ser His Val Leu Glu Sr 



170 



175 



Glu Thr lie Phe Thr Pro Ser Ser Val Lys Lys Lys Lys Arg Arg 



185 



190 



Arg Lys Lys Tyr Lys Gin Asp Ser Lys Lys Glu Glu Gin Ala HI 



200 



205 



Ser Ala Ala Ala Glu Asp Thr Cys Asp Val Gly Val Ser Ser Asp 
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215 ^ 220 225. 

Asp Asp Lys Gly Ala Gin Ala Ala Arg Gly Ser Ser Asn Ala Ser 
230 235 240 

Leu Lys Glu Glu Glu Cys Lys Glu Pro Leu Leu Phe His Ser Gly 
245 250 * 255 

Asp His' Tyr Pro Leu Ser Asp Gly Asp Trp Ser Pro Leu Glu Thr 
260 " 265 270 

Thr Tvr Pro Gin Thr .Ala Cys Pro Lys Ser Asp Ser Glu Leu Glu 
275 280 285 

Val Lvs Pro Ala Glu Ser Leu Leu Arg Ser Glu Tyr His Met Glu 

290 *■ ' 295 300 : 

Trp Thr Trp Gly Gly Phe' Pro Glu Ser Thr Lys Val Ser Lys Arg . _ 

305 310 ; 315 

Glu Arg Ser Asp His His Pro Arg Thr Ala ,Thr He Thr Pro Ser 
320 325 330 

Glu Asn Thr His Phe Arg Val He Pro Ser Glu Asp Asn Leu lie, 
335 ' 340 - - 345 

Ser Glu Val Glu Lys Asp Ala Ser Met Glu Asp Thr Val Cys Thr 
350 355 360 

He Val Lys Pro Lys Pro Arg Ala Leu Gly Thr Gin Met Ser Asp 
365 370 C 375 

Pro Thr Ser Val Ala Glu Leu Leu Glu Pro Pro Leu Glu Ser Thr 4 

380 385 390 

Gin He Ser Ser Met Leu Asp Ala Asp His Leu Pro Asn Ala Ala 
395 400 405 

Leu Ala Glu Ala Pro Ser Glu Ser Lys Pro Ala Ala Lys Val Asp 
410 415 420 

Ser Pro Ser Lys Lys Lys Gly Val His Lys Arg He Gin His Gin 
425 430 435 

Gly Pro Asp Asp He Tyr Leu Asp Asp Leu Lys Gly Leu Glu Pro 
440 445 450 

Glu Val Ala Ala Leu Tyr Phe Pro Lys Ser Glu Ser Glu Pro Gly 
455 460 465 

Ser Arg Gin Trp Pro Glu Ser Asp Thr Leu Ser Gly Ser Gin Ser 
470 475 480 

Pro Gin Ser Val Gly Ser Ala Ala Ala Asp Ser Gly Thr Glu Cys 
485 490 495 

Leu Ser Asp Ser Ala Met Asp Leu Pro Asp Val Thr Leu Ser Leu 
500 505 510 

Cys Gly Gly Leu Ser Glu Asn Gly Lys He Ser Lys Glu Lys Phe 
515 520 - 525 

Met Glu His He He Thr Tyr His Glu Phe Ala Glu Asn Pro Gly 
530 535 540 

Leu He Asp Asn Pro Asn Leu Val He Arg He Tyr Asn Arg Tyr 
545 550 555 

Tvr Asn Trp Ala Leu Ala Ala Pro Met He Leu Ser Leu Gin Val 
560 565 570 

Phe Gin Lys Ser Leu Pro Lys Ala Thr Val Glu Ser Trp Val Lys 
575 580 585 

Asp Lys Met Pro Lys Lys Ser Gly Arg Trp Trp Phe Trp Arg Lys 
590 595 600 

Arg Glu Ser Met Thr Lys Gin Leu Pro Glu Ser Lys Glu Gly Lys 
605 610 615 

Ser Glu Ala Pro Pro Ala Ser Asp Leu Pro Ser Ser Ser Lys Glu 
620 625 630 

Pro Ala Gly Ala Arg Pro Ala Glu Asn Asp Ser Ser Ser Asp Glu 
635 640 645 

Gly Ser Gin Glu Leu Glu Glu Ser He Thr Val Asp Pro He Pro 
650 655 660 

Thr Glu Pro Leu Ser His Gly Ser Thr Thr Ser Tyr Lys Lys Ser 
665 670 675 

Leu Arg Leu Ser Ser Asp Gin He Ala Lys Leu Lys Leu His Asp 
680 685 690 
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Gly Pro Asn Asp Val Val Phe Ser He Thr Thr Gin /Tyr Gin Gly 
695 700 nnZ 

Thr Cys Arg Cys Ala Gly Thr He Tyr Leu Trp Asn Trp Asn Asp 

Lys He ile lie Ser Asp He Asp -Gly He Thr Lys Ser Isp ' 

■ 725 * 730 

Ala Leu Gly Gin lie Leu Pro Gin Leu Gly Lys Asp Trp Thr His " " 

740 , ' 745 ■ nt^n 

Gin Gly He Ala Lys Leu Tyr His Ser lie Asn Glu ,Asn Gly Tyr ' ' 

755 ' 760 ncc 

; xys Phe Leu Tyr Cys Ser Ala Arg Ala He Gly Me t Ala Asp Mel , 

.- 775" ' 

Thr Arg. -'Gly Tyr Leu His Trp Val- Asn Asp Lys Gly Thr Ile'Zeu 

. Pro Arg ,Gly Pro Leu Met Leu Ser ;Pro Ser iS er Leu Phe W Ala ' 

8 00 " * 8 0 5 

• Phe His Arg Glu Val He* Glu Lys Lys Pro Glu Lys'' Phe Lys He 

- ' 815 f 820 ' ' 1 otc 

Glu Cys Leu Asn Asp He Lys Asn Leu Phe Ala Pro Ser' Lys Gin V 

83 0 835 o 

Pro Phe Tyr Ala Ala Phe Gly Asn Arg Pro Asn : A sp Val Tyr a" C ' . 

^ , 845 850 - ore 

Tyr Thr Gin Val Gly Val Pro Asp Cys Arg He' Phe Thr Val Isn 

8 6 0 865 

Pro Lys Gly Glu Leu He Gin Glu Arg Thr Lys Gly Asn Lys Ser 

85 880 oqc 

Ser Tyr His Arg Leu Ser Glu Leu Val Glu His Val Phe Pro Leu 

890 895 

Leu Ser Lys Glu Gin Asn Ser Ala Phe Pro Cys Pro Glu Phe III 



910 



Ser Phe Cys Tyr Trp Arg Asp Pro He Pro Glu Val Asp Leu Asp 

925 QTn 
Asp Leu Ser 9:30 



<210> 32 
<211> 268 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc.feature 

<223> Incyte ID No: 1708229CD1 

<400> 32 

Met Leu Gly Asp His Cys Ser Leu Pro Glu Asp Gin Ala Arg Pro 



Gly Gin Ser Leu Gin Ser Gly Leu Cys Cys Lys Met Val Leu Gin 
Ala Val Ser Lys Val Leu Arg Lys Ser Lys Ala Lys Pro Asn G ly 



Lys Lys Pro Ala Ala Glu Glu Arg Lys A Ia Tyr Leu Glu Pro G Ju 



50 



55 



His Thr Lys Ala Arg He Thr Asp Phe G^n Phe Lys Glu Leu vll 

Val Leu Pro Arg Glu He Asp Leu Asn olu Trp Leu Ala Ser Asn 

85 

Thr Thr Thr Phe Phe His His lie Asn Leu Gin Tyr Ser Thr He 

Ser Glu Phe Cys Thr Gly Glu Thr Cys all Thr Met Ala Val Jys 
110 115 

Trp Tyr Asp Glu Arg Gly Lys Lys Val 

Cys Thr Ala Pro Sin Tyr Val Asp Phe III Met Ser Ser Val Gin 



Asn Thr Gin Tyr Tyr Trp Tyr Asp Glu Arg Gly Lys Lys Val lyl 
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140 










145 








150 


LVS 

J ~' Jr *-' 


Leu Val 


Thr 


Asp 


Glu 


Asp 


Val 


Phe 


Pro 


Thr 


Lys 


Tyr Gly Arg 






155 










160 








165 


Glu 


Phe Pro 


Ser 


Ser 


Phe 


Glu 


Ser 


Leu 


Val 


Arg 


Lys 


He Cys 


Arg 






,170 










175 








180 


W"i c 
nxo 




His 


Val 


Leu 


Ala 


His 


lie 


Tyr 


Trp 


Ala 


His Phe 


Lys 






185" 










190 








195 


Glu 


Thr Leu 


Ala 


Leu 


Glu 


Leu 


His 


Gly 


His 


Leu 


Asn 


Thr Leu 


Tyr 






200 










205 








210 


Val 


His Phe 


lie 


Leu 


Phe 


Ala 


Arg 


Glu 


Phe 


Asn 


Leu 


Leu Asp 


Pro 






215 










220 








225 


Lys 


Glu Thr 


Ala 


,-Ile 


Met 


Asp 


Asp 


Leu 


Thr <51u 


Val 


Leu Cys 


Ser 






230 










235 








240 


Gly 


Ala Gly Gly 


Val, 


His 


Ser 


Gly 


Gly 


Ser 


Gly Asp 


Gly Ala 


Gly 






245 










250 








255 


Ser 


Gly Gly 


Pro 


Giy 


Ala 


Gin 


Asn 


His 


Val 


Lys 


Glu 


Arg 








260 










265. 











<210> 33' 
<211> 337 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> incyte ID No: 1806454CD1 



<400> 33 
























Val 




Met 


Leu 


Leu 


Gly 


Leu 


Ala 


Ala 


Met 


Glu 


Leu 


Lys 


Val 


Trp 


Asp 


1 






5 










10 










lb 


Gly 


He 


Gin 


Arg 


Val 


Val 


Cys 


Gly 


Val 


Ser 


Glu 


Gin 


Thr 


Thr 


Cys 








20 










25 










30 


Gin 


Glu 


Val 


Val 


He 


Ala 


Leu 


Ala 


Gin 


Ala 


He 


Gly 


Gin 


Thr 


Gly 








35 










40 










45 


Arg 


Phe 


Val 


Leu 


Val 


Gin 


Arg 


Leu 


Arg 


Glu 


Lys 


Glu 


Arg 


Gin 


Leu 








50 










55 










60 


Leu 


Pro 


Gin 


Glu 


Cys 


Pro 


Val 


Gly 


Ala 


Gin 


Ala 


Thr 


Cys 


Gly 


Gin 








65 










70 










75 


Phe 


Ala 


Ser 


Asp 


Val 


Gin 


Phe 


Val 


Leu 


Arg 


Arg 


Thr 


Gly 


Pro 


Ser 








80 










85 










90 


Leu 


Ala 


Gly 


Arg 


Pro 


Ser 


Ser 


Asp 


Ser 


Cys 


Pro 


Pro 


Pro 


Glu 


Arg 






95 










100 










105 


Cys 


Leu 


He 


Arg Ala 


Ser 


Leu 


Pro 


Val 


Lys 


Pro 


Arg 


Ala 


Ala 


Leu 








110 










115 










120 


Gly 


Cys 


Glu 


Pro 


Arg 


Lys 


Thr 


Leu 


Thr 


Pro 


Glu 


Pro 


Ala 


Pro 


Ser 






125 










130 










135 


Leu 


Ser 


Arg 


Pro 


Gly 


Pro 


Ala 


Ala 


Pro 


Val 


Thr 


Pro 


Thr 


Pro 


Gly 








140 










145 










150 


Cys 


Cys 


Thr 


Asp 


Leu 


Arg 


Gly 


Leu 


Glu 


Leu 


Arg 


Val 


Gin 


Arg 


Asn 






155 










160 










165 


Ala 


Glu 


Glu 


Leu 


Gly 


His 


Glu 


Ala 


Phe 


Trp 


Glu 


Gin 


Glu 


Leu 


Arg 








170 










175 










180 


Arg 


Glu 


Gin 


Ala 


Arg 


Glu 


Arg 


Glu 


Gly 


Gin 


Ala 


Arg 


Leu 


Gin 


Ala 








185 










190 










195 


Leu 


Ser 


Ala 


Ala 


Thr 
200 


Ala 


Glu 


His 


Ala 


Ala 

205 


Arg 


Leu 


Gin 


Ala 


Leu 
210 


Asp 


Ala 


Gin 


Ala 


Arg 


Ala 


Leu 


Glu 


Ala 


Glu 


Leu 


Gin 


Leu 


Ala 


Ala 








215 










220 










225 


GlU 


Ala 


Pro 


Gly 


Pro 


Pro 


Ser 


Pro 


Met 


Ala 


Ser 


Ala 


Thr 


Glu 


Arg 






230 










235 










240 


Leu 


His 


Gin 


Asp 


Leu 


Ala 


Val 


Gin 


Glu 


Arg 


Gin 


Ser 


Ala 


Glu 


Val 








245 










250 










255 


Gin 


Gly 


Ser 


Leu 


Ala 


Leu 


Val 


Ser 


Arg 


Ala 


Leu 


Glu 


Ala 


Ala 


Glu 








260 










265 










270 
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- Arg Ala Leu Gin Ala Gin Ala Gin Glu Leu Glu Glu Leu Asn Arg 

275 280 " 285 

Glu Leu Arg Gin Cys Asn Leu pin Gin Phe lie Gin Gin Thr Glv 
290 295 ' 300 

Ala <Ala-'Leu Pro Pro Pro Pro Arg Pro.. Asp Arg Gly Pro Pro Gly 

305 310 315 

Thr Gin Val Gly Val Val teu Gly Gly Gly Trp Glu Val Arg Thr'' 

320 325 33Q 

Trp Pro per Pro Thr Pro' Ser 

335 x ■ 

<210> 34 ■ . .•'.>: 

<211> 565 ' . . 

<212> PRT 1 ■ , . 

<213> Homo sapiens .' • 

<220> : 

,<22'l> misc_feature / ' , i . ' - i ■ 

<223> Incyte ID, No: 1806850CD1 '" " ■ 

« " / ' '' - 

<400> 34 * 

Met Lys Glu Glu Glu Glu Val Phe Gin Pro Met Leu Met Glu Tyr 

5 1° 15 

Phe Thr Tyr Glu Glu Leu Lys Tyr He Lys Lys Lys Val He Ala 
„ . 20 2S 30 

Gin His Cys Ser Gin Lys Asp Thr Ala Glu Leu Leu Arg Gly Leu 

Ser Leu Trp Asn His Ala Glu Glu Arg Gin Lys Phe Phe Lys Tyr 

50 55 60 

Ser Val Asp Glu Lys Ser Asp Lys Glu Ala Glu Val Ser Glu His 

Ser Thr Gly He Thr His Leu Pro Pro Glu Val Met Leu Ser lie 

Phe Ser Tyr Leu Asn Pro Gin Glu Leu Cys Arg Cys Ser Gin Val 

95 100 105 

Ser Met Lys Trp Ser Gin Leu Thr Lys Thr Gly Ser Leu Trp Lys 

110 115 120 

His Leu Tyr Pro Val His Trp Ala Arg Gly Asp Trp Tyr Ser Gly 

Pro Ala Thr Glu Leu Asp Thr Glu Pro Asp Asp Glu Trp Val 111 

140 14 5 150 

Asn Arg Lys Asp Glu Ser Arg Ala Phe His Glu Trp Asp Glu Asp 

155 160 165 

Ala Asp He Asp Glu Ser Glu Glu Ser Ala Glu Glu Ser He Ala 

170 175 igo 

He Ser He Ala Gin Met Glu Lys Arg Leu Leu His Gly Leu He 

185 195 
Hxs Asn Val Leu Pro Tyr Val Gly Thr Ser Val Lys Thr Leu Val 

200 205 210 

Leu Ala Tyr Ser Ser Ala Val Ser Ser Lys Met Val Arg Gin He 

215 220 225 

Leu Glu Leu Cys Pro Asn Leu Glu His Leu Asp Leu Thr Gin Thr 

Asp He Ser Asp Ser Ala Phe Asp Ser Trp Ser Trp Leu Gly Cys 
245 250 255 

Cys Gin Ser Leu Arg His Leu Asp Leu Ser Gly Cys Glu Lys He 
260 265 270 

Thr Asp Val Ala Leu Glu Lys He Ser Arg Ala Leu Gly He Leu 

mu 275 280 285 

Thr Ser His Gin Ser Gly Phe Leu Lys Thr Ser Thr Ser Lys He 

mu 290 295 300 

Thr Ser Thr Ala Trp Lys Asn Lys Asp He Thr Met Gin Ser Thr 

305 310 315 

Lys Gin Tyr Ala Cys Leu His Asp Leu Thr Asn Lys Gly He Gly 



PCT/USflO/19948 
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320 










325 










330 


Glu 


Glu 


He 


Asp 


Asn 


Glu 


His 


Pro 


Trp 


Thr 


Lys 


Pro 


Val 


Ser 


Ser 






335 










340 










345 


Glu 


Asn 


Phe 


Thr 


Ser 


Pro 


Tyr 


Val 


Trp 


Met 


Leu 


Asp 


Ala 


Gfu 


Asp 








350 










3.55 










360 


Leu 


Ala 


Asp 


He 


Glu 


Asp 


Thr 


Val 


Glu 


Trp 


Arg~ His 


Arg. 


Asn 


Val. 








365 










370 










37 5 


Glu 


Ser 


Leu 


Cys 


Val 


Met 


Glu 


Thr 


Ala 


Ser 


Asn 


Phe 


Ser ,Cys 


Ser 








380 










385 










390 


Thr 


Ser >Gly Cys 


Phe 


Ser 


Lys 


Asp 


He 


Val 


Gly 


Leu 


Arg 


Thr 


Ser 










395 










400 










405 


Val Cys 


Trp 


Gin 


Gin 


His 


Cys 


Ala 


Ser 


Pro 


Ala 


Phe 


Ala 


Tyr 


Cys 








410 










415 










420 


Gly 


His 


Ser 


pne 


Cvs 


Cys 


Thr 


Gly 


Thr 


Ala 


Leu 


Arg 


Thr Met 


Ser 








425 










,430 










43 5. 


Ser 


Leu 


Pro 


pi 

Cj±U 


Ser 


Ser 


Ala 


Met 


Cys 


Arg 


Lys 


Ala 


Ala 


Arg 


Thr 






440 










445 










450 


Arg 


Leu 


Pro 


Arg 


Gly 


Lys 


Asp 


Leu 


Ile^ Tyr 


Phe 


Gly 


Ser 


Glu 


Lys 








455 










460 










465 


Ser 


Asp 


Gin 


Glu 


Thr 


Gly 


Arg 


Val 


Leu 


Leu 


Phe 


Leu 


Ser 


Leu 


Ser 








470 










475 










480 


Gly 


Cys 


Tyr 


Gin 


He 


Thr 


Asp. 


His 


Gly Leu Arg 


Val 


Leu 


Thr 


Leu 




485 








490 










495 


Gly Gly Gly 


Leu 


Pro 


Tyr 


Leu 


Glu 


His 


Leu 


Asn 


Leu 


Ser 


Gly Cys 










500 










505 










510 


Leu 


Thr 


He 


Thr 


Gly 


Ala 


Gly 


Leu 


Gin 


Asp 


Leu 


Val 


Ser 


Ala 


Cys 








515 










520 










525 


Pro 


Ser 


Leu 


Asn 


Asp 
530 


Glu 


Tyr 


Phe 


Tyr 


Tyr 
535 


Cys 


Asp 


Asn 


He 


Asn 
540 


Gly 


Pro 


His 


Ala 


Asp 


Thr 


Ala 


Ser 


Gly 


Cys 


Gin 


Asn 


Leu 


Gin 


Cys 








545 










550 










555 


Gly 


Phe 


Arg 


Ala 


Cys 


Cys 


Arg 


Ser 


Gly 


Glu 


















560 










565 













<210> 35 
<211> 228 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1851534CD1 

<400> 35 



Met 


Asp 


Phe 


Ser 


Phe 


Ser 


Phe 


Met 


Gin 


Gly 


He 


Met 


Gly Asn 


Thr 


1 






5 










10 










15 


He 


Gin 


Gin 


Pro 


Pro 


Gin 


Leu 


He 


Asp 


Ser 


Ala 


Asn 


He 


Arg 


Gin 








20 










25 










30 


Glu 


Asp 


Ala 


Phe 


Asp 


Asn 


Asn 


Ser 


Asp 


He 


Ala 


Glu 


Asp 


Gly 


Gly 








35 










40 










45 


Gin 


Thr 


Pro 


Tyr 


Glu 


Ala 


Thr 


Leu 


Gin 


Gin 


Gly 


Phe 


Gin 


Tyr 


Pro 








50 










55 










60 


Ala 


Thr 


Thr 


Glu 


Asp 


Leu 


Pro 


Pro 


Leu 


Thr 


Asn 


Gly 


Tyr 


Pro 


Ser 








65 










70 










75 


Ser 


He 


Ser 


Val 


Tyr 
80 


Glu 


Thr 


Gin 


Thr 


Lys 
85 


Tyr 


Gin 


Ser 


Tyr 


Asn 
90 


Gin 


Tyr 


Pro 


Asn 


Gly 


Ser 


Ala 


Asn 


Gly 


Phe 


Gly 


Ala 


Val 


Arg 


Asn 








95 










100 










105 


Phe 


Ser 


Pro 


Thr 


Asp 
110 


Tyr 


Tyr 


His 


Ser 


Glu 
115 


He 


Pro 


Asn 


Thr 


Arg 
120 


Pro 


His 


Glu 


He 


Leu 
125 


Glu 


Lys 


Pro 


Ser 


Pro 
130 


Pro 


Gin 


Pro 


Pro 


Pro 
135 


Pro 


Pro 


Ser 


Val 


Pro 
140 


Gin 


Thr 


Val 


He 


Pro 
145 


Lys 


Lys 


Thr 


Gly 


Ser 
150 
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Pro Glu He Lys Leu Lys He Thr Lys Thr He Gin Asn Gly Arg 
155 160 16 5 

Glu Leu Phe Glu Ser Ser Leu Cys Gly Asp Leu Leu Asn Glu Val 

Gin Ala Ser Glu . His Thr Lys Ser Lys "His: Glu Ser Arg Lys Glu f 



Lys Arg Lys Lys Ser Asn Lys 'His Asp Ser Ser Arg Ser Glu' Glu 
200 • 205; 210 

Arg Lys Ser His Lys' He Pro Lys Leu Glu Pro' Glu Glu Gin Asn 
215 220 1 ->->c 

Met Thr Lys . ■ « V. . , ' , < 225 

<210> 3 6' * c \ 

<211> 495. t 1 

<212> PRT *' 
<213> Homo sapiens 

<22 0> ■ . * ^ • ■' * ' . \ 

<221> misc_f eature * ' 

<223> Incyte ID No: 1868749CD1 V 
<400> 36 

Met Lys Gly Met Lys Val Glu Val Leu Asn Ser Asp Ala Val Leu 
Pro Ser Arg Val Tyr Trp He Ala Ser Val He Gin Thr Ala GlJ 

rru 20 25 30 

Tyr Arg Val Leu Leu Arg Tyr Glu Gly Phe Glu Asn Asp Ala Ser 
35 40 .j. 

His Asp Phe Trp Cys Asn Leu Gly Thr Val Asp Val His Pro He 
50 55 g 

Gly Trp Cys Ala lie Asn Ser Lys He Leu Val Pro Pro Arg Thr 

He His Ala Lys Phe Thr Asp Trp Lys Gly Tyr Leu Met Lys Arg 

Leu Val Gly Ser Arg Thr Leu Pro Val Asp Phe His He Lys Met 

Val Glu Ser Met Lys Tyr Pro Phe Arg Gin Gly Met Arg Leu Glu 
110 us 12Q 

Val Val Asp Lys Ser Gin Val Ser Arg Thr Arg Met Ala Val Val 

Asp Thr Val lie Gly Gly Arg Leu Arg Hu Leu Tyr Glu Asp Gly 
140 145 iso 

Asp Ser Asp Asp Asp Phe Trp Cys His Met Trp Ser Pro Leu He 
155 160 165 

His Pro Val Gly Trp Ser Arg Arg Val Gly His Gly He Lys Met 
170 175 180 

Ser Glu Arg Arg Ser Asp Met Ala His His Pro Thr Phe Arg Lys 

185 190 19=; 

He Tyr Cys Asp Ala Val Pro Tyr Leu Phe Lys Lys Val Arg Ala 

200 205 7ln 

val Tyr Thr Glu Gly Gly Trp Phe Glu Glu Gly Met Lys Leu Glu 
T , 215 220 2 25 

Ala He Asp Pro Leu Asn Leu Gly Asn He Cys Val Ala Thr Val 

230 235 240 

Cys Lys Val Leu Leu Asp Gly Tyr Leu Met He Cys Val Asp Glv 
„ 245 250 255 

Gly Pro Ser Thr Asp Gly Leu Asp Trp Phe Cys Tyr His Ala Ser 

260 265 270 

Ser His Ala He Phe Pro Ala Thr Phe Cys Gin Lys Asn Asp He 

275 280 285 

Glu Leu Thr Pro Pro Lys Gly Tyr Glu Ala Gin Thr Phe Asn Trp 

290 295 300 

Glu Asn Tyr Leu Glu Lys Thr Lys Ser Lys Ala Ala Pro Ser Arg 
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jUj 








Leu 


Phe 


Asn 


Met 


Asp 
320 


Cys 


Pro 


Asn 


Lys 


Leu 


Glu 


Ala 


Val 


Asp 


Leu 


Met 








335 








Ala 


Thr 


Val 


Lys 


Arg 
3 50 


val 


Val 


His 


Asp 


Gly. 


Trp 


Asp 


Ser 


Glu 


Tyr Asp 










"3 C C 

365 








Pro 


Asp 


He 


Tyr 


Pro 
380 


Val 


Gly 


Trp 


■Leu 


Gln- 


Pro 


Pro 


Val 
395 


Ala 


Ala 


Glu 


Lys 


Glu 


Ala 


Thr 


Lys 


Lys 


Lys 


Lys 








410 








Lys 


Arg 


He 


Pro 


Pro 


Thr 


Lys 


Thr 








425 








Lys 


Lys 


Pro 


Leu 


Leu 


Glu 


Asp 


Asp 






440 








Ser 


Ser 


Glu 


Pro 


Val 
455 


Pro 


Gly 


Glu 


Glu 


Glu 


His 


Leu 


Asp 
470 


Val 


Ala 


Ser 


Glu 


Leu 


Pro 


Val 


Ser 
485 


Val 


Glu 


Asn 



<210> 37 
<211> 1336 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1980010CD1 

<400> 37 

Met Val Asp Gin Leu Glu Gin He 

1 5 
Glu Lys His Gly Leu Glu Lys Pro 
20 

Gin Ser Ser Ser Glu Glu Ala Arg 
35 

Arg His Thr Gly Arg Lys Gin Pro 
50 

Arg Thr Leu Leu Gin Asp Met Leu 
65 

Thr Cys Leu Asp Ser Asp Ala Cys 
80 

Leu Leu Cys Ser Ser Arg Leu Glu 
95 

Met Met His Cys Ser Ala Cys Ser 
110 

Ala His Lys Gly Asn Pro His Tyr 
125 

He Asp Leu Val Leu Ala Ala Ser 
140 

Thr Asn Leu Thr Asp Ser Cys Met 
155 

Gin Leu He Thr Asp Arg Pro Pro 
170 

Leu He Gin Ala Val Gly Cys Leu 
185 

Leu Pro Leu Gin Val Arg Leu Cys 





310 










315 




325 


Phe 


Lys 


Val 


Glv 

vja y 


Met 
330 




Pro 

340, 


Arg 


Leu 


He 


Cvs 

v- y o 


Val 
345 


Arg 


Leu 

J -J J 


Leu 


Ser 


He 


His 


Phe 
3 60 




Trp 
370 


v d J_ 


Asp 


v-.y to 


Glu' 


Ser 
375 


Cys - 


La J.U 

385 


Leu 


1 XIX 






Gin 
390 


Pro 


nla 

400 


1 I IX. 


Pro 


Leu 


T ,\rc 

*~>y a 


Ala 
405 


Lys 


Vji.il 


irlltr 




Lys 


Lys 


Arg 


415 










420 


Arg 


Pro 
430 


Leu 


ax y 




Gly 


Ser 
435 


Pro 


v?in 
445 




Al 3 
rli. d 


Arg 


T ,vc; 


He 
450 


He 


He 
460 


Ala 


Val 


Arg 


Val 


Lys 
465 


Pro 


Asp 
475 


Lys 


Ala 


Ser 


Ser 


Pro 
480 


He 


Lys 
490 


Gin 


Glu 


Thr 


Asp 


Asp 
495 



Leu 


Ser 


Val 


Ser 


Glu 


Leu 


Leu 




10 










15 


He 


Ser 


Phe 


Val 


Lys 


Asn 


Thr 




25 










30 


Lys 


Leu 


Met 


Val 


Arg 


Leu 


Thr 


40 










45 


Pro 


Val 


Ser 


Glu 


Ser 


His 


Trp 




55 










60 


Thr 


Met 


Gin 


Gin 


Asn 


Val 


Tyr 




70 










75 


Tyr 


Glu 


He 


Phe 


Thr 


Glu 


Ser 


85 










90 


Asn 


He 


His 


Leu 


Ala 


Gly Gin 




100 










105 


Glu 


Asn 


Pro 


Pro 


Ala 


Gly 


He 




115 










120 


Arg 


Val 


Ser 


Tyr 


Glu 


Lys 


Ser 


130 










135 


Arg 


Glu 


Tyr 


Phe 


Asn 


Ser 


Ser 


145 










150 


Asp 


Leu 


Ala 


Arg 


Cys 


Cys 


Leu 




160 










165 


Ala 


He 


Gin 


Glu 


Glu 


Leu 


Asp 




175 










180 


Glu 


Glu 


Phe 


Gly 


Val 


Lys 


He 




190 










195 


Pro 


ASp 


Arg 


He 


Ser 


Leu 


He 
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200 205 210 

Lys Glu Cys lie Ser Gin Ser Pro Thr Cys Tyr Lys Gin Ser Thr 

Lys Leu Leu Gly Leu A\a Glu Leu Leu Arg Val AU Gly Glu III 

230 . 235- ' ' 240 ' ' 

Pro Glu Glu Arg Arg Gly Gin Val Leu lie Leu Leu Val Glu Gin ' 
245 ' 250 ' 255 

Ala Leu Arg Phe His Asp Tyr 'Lys Ala Ala Ser Met His /Cys Gin 

Glu Leu Met Ala Thr Gly . Tyr Pro Lys Ser Trp Asp Va/ Cys III 
. ' 275 * ? . 280 '"> >285 

Gin Leu Gly Gin Ser Glu Gly, Tyr Gin Asp Leu Ala Thr Arg Gin 

290 . ' "295 • ! 3 00 ' 

Glu Leu Met Ala JJ| Ala Leu Thr His Cys Pro Pro Ser Ser He ; 

Glu Leu Leu Leu Ala Ala Ser Ser Ser Leu Glri Thr' Glu He Leu ' ' ' 

320" ' 325 -' 

Tyr Gin Arg Val Asn Phe Gin He' His His Glu Gly Gly Glu : Asn 

335 340 345 

He Ser Ala Ser Pro Leu Thr Ser Lys Ala Val Gin' Glu Asp Glu 

350 355 3so 

Val Gly Val Pro Gly Ser Asn Ser Ala Asp Leu Leu Arg Trp Thr 
365 370 375 

Thr Ala Thr Thr Met Lys Val Leu Ser Asn Thr Thr Thr Thr Thr 
380 385 390 

Lys Ala Val Leu Gin Ala Val Ser Asp Gly Gin Trp Trp Lys Lys 
395 400 4Q5 

Ser Leu Thr Tyr Leu Arg Pro Leu Gin Gly Gin Lys Cys Gly Gly 
410 415 420 

Ala Tyr Gin He Gly Thr Thr Ala Asn Glu Asp Leu Glu Lys Gin 
425 4 30 435 

Gly Cys His Pro Phe Tyr Glu Ser Val He Ser Asn Pro Phe Val 
440 445 450 

Ala Glu Ser Glu Gly Thr Tyr Asp Thr Tyr Gin His Val Pro Val 
455 460 465 

Glu Ser Phe Ala Glu Val Leu Leu Arg Thr Gly Lys Leu Ala Glu 
470 475 480 

Ala Lys Asn Lys Gly Glu Val Phe Pro Thr Thr Glu Val Leu Leu 
485 490 495 

Gin Leu Ala Ser Glu Ala Leu Pro Asn Asp Met Thr Leu Ala Leu 
sl 500 505 5 t o 

Ala Tyr Leu Leu Ala Leu Pro Gin Val Leu Asp Ala Asn Arg Cys 
515 5 20 525 

Phe Glu Lys Gin Ser Pro Ser Ala Leu Ser Leu Gin Leu Ala Ala 

Tyr Tyr Tyr Ser Leu Gin He Tyr Ala Arg Leu Ala Pro Cys Phe 
545 550 555 

Arg Asp Lys Cys His Pro Leu Tyr Arg Ala Asp Pro Lys Glu Leu 
560 565 570 

He Lys Met Val Thr Arg His Val Thr Arg His Glu His Glu Ala 

Trp Pro Glu Asp Leu He Ser Leu Thr Lys Gin Leu His Cys Tyr 
590 595 600 

Asn Glu Arg Leu Leu Asp Phe Thr Gin Ala Gin He Leu Gin Gly 

Leu Arg Lys Gly Val Asp Val Gin Arg Phe Thr Ala Asp Asp Gin 

Tyr Lys Arg Glu Thr He Leu Gly Leu Ala Glu Thr Leu Glu Glu 
635 64 0 645 

Ser Val Tyr Ser lie Ala He Ser Leu Ala Gin Arg Tyr Ser Val 

Ser Arg Trp Glu Val Phe Met Thr His Leu Glu Phe Leu Phe Thr 
665 670 675 
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Asp Ser Gly Leu Ser Thr. Leu Glu He Glu Asn Arg Ala Gin Asp 
680 685 690 

Leu His Leu Phe Glu Thr Leu Lys Thr Asp Pro Glu Ala Phe His 
695 7 0° 705 

Gin: His Met Val Lys Tyr He Tyr Pro Thr lie Gly Gly Phe Asp 

710 715 z 
His Glu Arg Leu Gin Tyr Tyr Phe Thr Leu Leu Glu Asn Cys Gly 
725 730 /jd 
Cys Ala Asp Leu Gly Asn Cys Ala lie Lys Pro Glu Thr His Il< 
740 745 : /5 ° 
' Arg Leu Leu Lys Lys Phe Lys Val Val Ala Ser Gly Leu AsnTyf 
- " 760 '-/bo 



755 



Lys Lys Leu Thr ( Asp Glu Asn Met Ser Pro S Leu Glu Ala Leu Glu 
770 ; 77 5 . • 7 , ? 

Pro Val Leu Ser Ser Gin Asn- He Leu Ser lie Ser Lys. Leu .Val 
785 790- 795 

Pro Lys lie Pro Glu Lys Asp Gly Gin Met Leu Ser ; Pro Ser Ser 
800 805 . 810 

Leu Tyr Thr He Trp Leu Gin Lys. Leu Phe Trp Thr Gly Asp Pro 
815 820 825 

His Leu He Lys Gin Val . Pro Gly Ser Ser Pro Glu Trp Leu His 
830 835 840 

Ala Tyr Asp Val Cys Met Lys Tyr Phe Asp Arg Leu His Pro Gly 

845 850 ? 

Asp Leu He Thr Val Val Asp Ala Val Thr Phe Ser Pro Lys Ala 
860 865 870 

Val Thr Lys Leu Ser Val Glu Ala Arg Lys Glu Met Thr Arg Lys 
875 880 885 

Ala He Lys Thr Val Lys His Phe He Glu Lys Pro Arg Lys Arg 
890 895 900 

Asn Ser Glu Asp Glu Ala Gin Glu Ala Lys Asp Ser Lys Val Thr 
905 91° 915 

Tyr Ala Asp Thr Leu Asn His Leu Glu Lys Ser Leu Ala His Leu 

920 925 
Glu Thr Leu Ser His Ser Phe He Leu Ser Leu Lys Asn Ser Glu 
935 940 945 

Gin Glu Thr Leu Gin Lys Tyr Ser His Leu Tyr Asp Leu Ser Arg 
950 955 960 

Ser Glu Lys Glu Lys Leu His Asp Glu Ala Val Ala He Cys Leu 
965 970 9,75 

Asp Gly Gin Pro Leu Ala Met He Gin Gin Leu Leu Glu Val Ala 

Val Gly Pro Leu Asp He Ser Pro Lys Asp He Val Gin Ser Ala 
995 1000 1005 

He Met Lys He He Ser Ala Leu Ser Gly Gly Ser Ala Asp Leu 

1010 1015 1020 

Gly Gly Pro Arg Asp Pro Leu Lys Val Leu Glu Gly Val Val Ala 
Y 1025 1030 1035 

Ala Val His Ala Ser Val Asp Lys Gly Glu Glu Leu Val Ser Pro 

1040 1045 1050 

Glu Asp Leu Leu Glu Trp Leu Arg Pro Phe Cys Ala Asp Asp Ala 

1055 1060 1065 

Trn Pro Val Arg Pro Arg He His Val Leu Gin He Leu Gly Gin 

1070 1075 1080 

Ser Phe His Leu Thr Glu Glu Asp Ser Lys Leu Leu Val Phe Phe 

1085 1090 1095 

Arg Thr Glu Ala He Leu Lys Ala Ser Trp Pro Gin Arg Gin Val 

1100 H05 IHO 

Asp He Ala Asp He Glu Asn Glu Glu Asn Arg Tyr Cys Leu Phe 

1115 1120 H25 

Met Glu Leu Leu Glu Ser Ser His His Glu Ala Glu Phe Gin His 

1130 1135 1140 

Leu Val Leu Leu Leu Gin Ala Trp Pro Pro Met Lys Ser Glu Tyr 
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Val lie Thr Asn^Asn Pro Trp Val Arg"" Ala Thr Val Met^eu 
Thr Arg cys Thr Met Glu Asn Lys Glumly Leu Gly Asn Glu"™ . 

: 1175 ' 1180 ii:o = 

Leu Lys Met Cys Arg Ser Leu 'Tyr Asn Thr Lys . Gin- Met leu Pro 
1190, ' 1 , n i 

Ala Glumly Val Lys Glu. Leu- Cys Leu Leu Leii Leu AsnGln Ser " * ' 
T T 1 . 205 1210 12 1*5-. 

Leu Leu Leu. Pro ; Ser Leu Lys Leu Leu . Leu 6lu Ser Arg Asp Glu 
* 122 0 " 1225 19 7m 

His Leu His- Glu^Met Ala Leu ^Glu <Gln He Thr Ala" Val Thr Thr/ 

Val Asn Asp Ser^Asn Cys Asp Girl Glu" IS Leu Ser Leu Leu^eu " t 

Asp Ala Lys Leu Leu Val Lys cys Val^r Thr Pro Phe Tyr^ro 
, '■' 5 1270' 127 <5 

Arg, lie val Asp His Leu .Leu Ala Ser Leu Gin Gin Gly Arg Trp 
1280 1285 loon 

Asp Ala Glu Glu^eu Gly Arg His Leu Arg Glu Ala Gly His Glu 

Ala Glu Ala Glyjer Leu Leu Leu Ala"™ Arg Gly Thr His"^ 
Ala Phe Arg Thr Phe Ser Thr Ala Leu"^ Ala Ala Gin His^rp 
Val 1325 1330 1335 

<210> 38 

<211> 934 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 2259032CD1 



<400> 38 

Met Phe Trp Lys Phe Asp Leu Asn Thr Thr Ser His Val Asp Lys 

10 

Leu Leu Asp Lys Glu His Val Thr Leu Gin Glu Leu Met Asp G^u 

Asp Asp He Leu Gin Glu Cys Lys Ala Gin Asn Gin Lys Leu Leu 

Asp Phe Leu Cys Arg Gin Gin Cys Met Glu Glu Leu Val Ser Leu 
50 55 60 

'he 
75 
>ro 

Gin He Ser Asp Arg Leu Gly Gly Asp Glu Ser Leu Leu Ser Leu 
95 100 105 

, , „ - - Leu Asn Pro Leu 

10 H5 120 

Ser Lys Thr He Gly Asn Leu He Ala Arg _ _ 
125 130 135 



He Thr Gin Asp Pro Pro Leu Asp Met Glu Glu Lys Val Arg Phe 
65 7 0 __ 



Lys Tyr Pro Asn Thr Ala Cys Glu Leu Leu Thr Cys Asp Val Pro 

80 * 



85 



Leu Tyr Asp Phe Leu Asp His Glu Pro Pro Leu Asn Pro Leu Leu 
Ala Ser Phe Phe Ser Lys Thr He Gly Asn Leu He Ala Arg Lys 
Thr Glu Gin Val lie Thr Phe Leu Lys Lys Lys Asp Lys Phe He 
Ser Leu Val Leu Lys His He Gly Thr Ser Ala Leu Met Asp Leu 



Leu Leu Arg Leu Val Ser Cys Val Glu Pro Ala Gly Leu Arg Gin 

Trp 

185 190 195 



Asp val Leu His Trp Leu Asn Glu'ciu HI Val lie Gin Arg Leu 
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Val Glu Leu He His Pro Ser Gin Asp Glu Asp Arg Gin Ser Asm 
200 205 210 

Ala Ser Gin Thr Leu Cys Asp Her Val Arg Leu Gly Arg Asp»Gln 
215 220 225 

Gly Ser Gin Leu Gin Glu Ala Leu Glu Pro. Asp Pro Leu Leu Thr 
^ y i: 230 , '' - 235" ^ 240 

Ala -Leu Glu Ser Arg cin Asp Cys Val Glu Gin Leu Leu Lys Asn 
245 250 255 

Met Phe 'Asp Gly Asp Arg Thr Glu 'Ser Cys Leu Val Ser Gly Thr 
260 265 270 

Gin Val Leu Leu Thr Leu Leu Glu Thr .Arg Arg Val Gly Thr Glu 

275 280 / 28 

Gly Leu Val Asp Ser Phe Ser Gin Gly Leu Glu Arg Ser Tyr Ala 
290 295 300 

Val Ser Ser Ser Val Leu His Gly He Glu Pro Arg Leu . Lys Asp 
V 305 " 310 315 

Phe His Gin Leu Leu Leu Asn Pro Pro Lys Lys Lys Ala He Leu 
.* 320 325 330 

Thr Thr He Gly Val Leu Glu Glu Pro Leu Gly Asn Ala Arg Leu 
335 340 345 

His Gly Ala Arg Leu Met Ala Ala Leu Leu His Thr Asn Thr Pro 
350 355 360 

Ser He Asn Gin Glu Leu Cys Arg Leu Asn Thr Met Asp Leu Leu 
365 370 375 

Leu Asp Leu Phe Phe Lys Tyr Thr Trp Asn Asn Phe Leu His Phe 
380 385 390 

Gin Val Glu Leu Cys He Ala Ala He Leu Ser His Ala Ala Arg 
395 400 405 

Glu Glu Arg Thr Glu Ala Ser Gly Ser Glu Ser Arg Val Glu Pro 

410 415 
Pro His Glu Asn Gly Asn Arg Ser Leu Glu Thr Pro Gin Pro Ala 
425 430 435 

Ala Ser Leu Pro Asp Asn Thr Met Val Thr His Leu Phe Gin Lys 
440 445 450 

Cys Cys Leu Val Gin Arg He Leu Glu Ala Trp Glu Ala Asn Asp 
455 460 465 

His Thr Gin Ala Ala Gly Gly Met Arg Arg Gly Asn Met Gly His 
470 475 480 

Leu Thr Arg He Ala Asn Ala Val Val Gin Asn Leu Glu Arg Gly 
485 490 495 

Pro Val Gin Thr His He Ser Glu Val He Arg Gly Leu Pro Ala 
500 505 510 

Asp Cys Arg Gly Arg Trp Glu Ser Phe Val Glu Glu Thr Leu Thr 
515 520 b ^ 

Glu Thr Asn Arg Arg Asn Thr Val Asp Leu Ala Phe Ser Asp Tyr 
530 535 540 

Gin He Gin Gin Met Thr Ala Asn Phe Val Asp Gin Phe Gly Phe 
545 550 555 

Asn Asp Glu Glu Phe Ala Asp Gin Asp Asp Asn He Asn Ala Pro 
560 565 570 

Phe Asp Arg He Ala Glu He Asn Phe Asn He Asp Ala Asp Glu 
575 580 585 

Asp Ser Pro Ser Ala Ala Leu Phe Glu Ala Cys Cys Ser Asp Arg 
590 595 600 

He Gin Pro Phe Asp Asp Asp Glu Asp Glu Asp He Trp Glu Asp 
605 610 615 

Ser Asp Thr Arg Cys Ala Ala Arg Val Met Ala Arg Pro Arg Phe 
620 625 630 

Gly Ala Pro His Ala Ser Glu Ser Cys Ser Lys Asn Gly Pro Glu 
Y 635 640 645 

Arg Gly Gly Gin Asp Gly Lys Ala Ser Leu Glu Ala His Arg Asp 
650 655 660 

Ala Pro Gly Ala Gly Ala Pro Pro Ala Pro Gly Lys Lys Glu Ala 
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Pro Pro Val Glu Gly Asp Ser Glu Ala Gly Ala Met Trp Thr til 
-, ^ 680 6 85 cq n 

Val Phe Asp Glu Pro Ala Asn Ser THr Pro Thr Ala Pro Gly Val 
1 i . 695 - : 700 ' ' • 7 n£- 

Val Arg Asp, Val Gly Ser Ser Val Trp . Ala Ala Gly. Thr Ser Ala 

Pro Glu G^Lys Gly Trp Ala Lys pL IH AjSp Phe Gin Pro Phe 

Cys Cys Ser Glu sJ Gly Pro Arg Cys Ser Ser Pro Val Asp Thr 

G*u Cys Ser His Ala 'Glu Gly Ser Arg Ser Gin, Gly Pro Glu* Lys 

Ala Phe Ser Pro Ala Ser Pro Cys Ala. Trp Asn Val Cys Val Thr 
» : T , 70 ~ 1 ■ 7 75 7 o 0 

Arg. Lys Ala Pro' Leu Leu Ala Ser Asp Ser >Ser, Ser Ser Gly Gly 
' ' 785 '" 790 7 q£ 

Ser His Ser Glu Asp Gly Asp Gin Lys Ala Ala Ser Ala" Met Asp 

'« oOO 805 
Ala Val Ser Arg Gly Pro Gly Arg Glu Ala Pro Pro Leu Pro Thr* 
" 8X5 820 825 

835 ~ 

Ser Arg Leu Leu Ser Pro Ala Cys Pro Ala Pro Lys Glu Val Thr 
845 850 



Val Ala Arg Thr Glu Glu Ala Val Gly Arg Val Gly Cys Ala Asp 



Ala Ala Pro Ala Val Ala Val Pro Pro Glu Ala Thr Val Ala lie 

8 60 



865 



Thr Thr Ala Leu Ser Lys Ala Gly Pro Zl He Pro Thr Pro 111 



875 



880 



Val Ser Ser Ala Leu Ala Val Ala Val Pro Leu Gly Pro He Met 



890 



895 



Ala Val Thr Ala Ala Pro Ala Met Val 111 Thr Leu Gly Thr 



905 



910 



Thr Lys Asp Gly Lys Thr Asp Ala Pro Pro Glu Gly Ala Ala Leu 



Asn Gly Pro Val 



920 



925 



930 



<210> 39 
<211> 515 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2359526CD1 



<400> 39 

Met Ala Ala Asn Met Tyr Arg Val Gly Asp Tyr Val Tyr Phe Glu 

Asn Ser Ser Ser Asn Pro Tyr Leu He Arg Arg lie Glu Glu Leu 

Asn Lys Thr Ala Ser Gly Asn Val Glu A^a Lys Val Val Cys Phe 



40 



Tyr Arg Arg Arg Asp He Ser Asn Thr Leu He Met Leu Ala Asp 
Lys His Ala Lys Glu He Glu Glu Glu Ser Glu Thr Thr Val gIu 



" 70 



Ala Asp Leu Thr Asp Lys Gin Lys His Gin Leu Lys His Arg gIu 

80 g5 
Leu Phe Leu Ser Arg. Gin Tyr Glu Ser Leu Pro Ala Thr His lie 

Arg Gly Lys Cys Ser Val Ala Leu Leu lln Glu Thr Glu Ser Val 
110 I" 120 
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Ser 
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Ser 


Leu 








TIC 

2 /b 








Tyr 


Gly 


Lys 


Asp 


Phe 


Asn 


Asp 


lie 
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Ser 
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*D A C 
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3 20 
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Cys 
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"3 O O 
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Pro 
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3 95 
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Trp 
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Trp 


Lys 


Lys 


Tyr 










410 
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Ser 


Glu 


Glu 


Glu 


Lys 


Leu 


Ser 










42b 








Pro 


Arg 


Val 


Arg 


Ser 


His 


Val 


Ser 










440 








Pro 


Val 


Arg 


Asn 


Thr 


Gly 


Ser 


Pro 










4bb 








Gin 


Ala 


Phe 


Phe 


Leu 


His 


Thr 


Thr 










470 








Gin 


Val 


Cys 


Lys 


Asn 


Thr 


Leu 


Arg 










485 








Pro 


Phe 


Val 


Ala 


He 


Asn 


Tyr 


Ala 










500 








Met 


Leu 


Leu 


Asn 


Ser 
















515 









<210> 40 
<211> 146 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2456494CD1 
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1 *7 <^ ' 
1 / 3 
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o o c 
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Ser 
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^ b u 
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Glu 
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2 ob 










270 


Phe 


Glu 
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1 Oft 
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Trp 




b 










3 00 
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Tyr 
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Trp 


Lys 


Thr 




310 










315 


Arg 
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Lys 


Ala 


Ala 


Glu 


Ala 




325 










330 


He 


Pro 


Thr 


Tyr 


Ser 


Lys 


Pro 




340 










345 


Asn 


Gly 


Lys 


Pro 


Gly Ala 


Val 




355 










3 60 
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Pro 


Gin 
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Pro 
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Leu 




370 
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Ala 


Thr 


Gin 
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His 


Gin 


Trp 




385 
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Arg 
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Cys 
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He 




400 
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Gly Gly 
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Lys 
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Pro 


Thr 




415 
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Pro 
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Pro 


Thr 


Thr 


Glu 


Asp 




430 
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Arg 


Gin 


Ala 


Met 


Gin 


Gly 


Met 


445 










450 


Lys 


Ser 


Ala 


IT- 1 

val 


Lys 


Thr 


Arg 


460 










465 


Tyr 


Phe 


Thr 


Lys 


Phe 


Ala 


Arg 




475 










480 


Leu 


Arg 


Gin 


Ala 


Ala 


Arg 


Arg 




490 










495 


Ala 


He 


Arg 


Ala 


Glu 


Cys 


Lys 




505 










510 
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<400> ^40 

Met Val fl Asp Glu Leu Val Leu Leu Leu His Ala Leu Leu Met Arg 

rT . 1 5 10 15 

His Arg Ala Leu Ser He Glu Asn Ser Gin Leu Met Glu pin Leu 

" 2< ? ■ -m * 25 : . 30 

Arg > Leu. Leu Val Cys Glu Arg Ala ,Ser Leu Leu Af g Gin (Val Arg 

■* * * 35. v 40 

Pro Proi Ser Cys Pro 4 Val Pro Phe > Pro Glu Thr ' Phe Asn Gly 'Glu 

50 55 . 60 



PCT/US00/19948 



Ser ser Arg Leu Pro Glu Phe lie Val Gin Thr Ala Ser Tyr Met 

65 v ; . . ' . 70 : : n 75 

Leu Val Asn Glu Asn Arg Phe Cys Asn Asp, Ala Met Lys Val Ala 

80 85 ■ 



65 V 70 75 

Ma 

Phe Leu lie Ser Leu Leu Thr Gly Glu Ala Glu Glu Trp*' Val Val 
•Pro. Tyr He Glu Met Asp Ser Pro lie Leu Gly Asp Tyr Arg Ala 

Phe Leu Asp Glu Met Lys Gin Cys Phe Gly Trp Asp Asp Asp Glu 
125 130 135 

Asp Asp Asp Asp Glu Glu Glu Glu Asp Asp Tyr 
140 145 

<210> 41 

<211> 580 

<212> PRT 

< 213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2668536CD1 

<400> 41 

Met Lys Glu Asn Lys Glu Asn Ser Ser Pro Ser Val Thr Ser Ala 
Asn Leu Asp His Thr Lys Pro Cys Trp Tyr Trp Asp Lys Lys Asp 

Leu Ala His Thr Pro Ser Gin Leu Glu Gly Leu Asp Pro Ala Thr 
3 5 40 - c- 

Glu Ala Arg Tyr Arg Arg Glu Gly Ala Arg Phe lie Phe Asp Val 
50 55 g0 

Gly Thr Arg Leu Gly Leu His Tyr Asp Thr Leu Ala Thr Gly He 

He Tyr Phe His Arg Phe Tyr Met Phe His Ser Phe Lys Gin Phe 
80 85 on 

Pro Arg Tyr Val Thr Gly Ala Cys Cys Leu Phe Leu Ala Gly Lys 

Val Glu Glu Thr Pro Lys Lys Cys Lys Asp He He Lys Thr Ala 

110 H5 120 

Arg Ser Leu Leu Asn Asp Val Gin Phe Gly Gin Phe Gly Asp Asp 

Pro Lys Glu Glu Val Met Val Leu Glu Arg He Leu Leu Gin Thr 

140 145 150 

He Lys Phe Asp Leu Gin Val Glu His Pro Tyr Gin Phe Leu Leu 

155 160 165 

Lys Tyr Ala Lys Gin Leu Lys Gly Asp Lys Asn Lys He Gin Lys 

170 175 mo 

Leu Val Gin Met Ala Trp Thr Phe Val Asn Asp Ser Leu Cys Thr 

185 190 195 

Thr Leu Ser Leu Gin Trp Glu Pro Glu He He Ala Val Ala Val 

200 205 210 

Met Tyr Leu Ala Gly Arg Leu Cys Lys Phe Glu He Gin Glu Trp 

Thr Ser Lys Pro Met Tyr Arg Arg Trp Trp Glu Gin Phe Val Gin 
230 235 240 
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Ser 
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290 
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Lys 


He 
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He 
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350 










355 








» • • 


360 


Thr 


His 


Pro 


Pro 


Leu 


Pro 


Pro 


Ala 


His 


Pro 


Pro 


Pro 


Asp 


Arg 


Lys , 










365 










370 










375 


Pro 


Pro 


Leu 


Ala 


Ala 


Ala 


Leu 


Gly 


Glu 


Ala 


Glu 


Pro 


Pro 


Gly 


Pro , 










3 80 










385 










390 


Val 


Asp 


Ala 


Thr 


Asp 


Leu 


Pro 


Lys 


Val- 


Gin 


He 


Pro 


Pro 


Pro 


Ala 










395 










400 










405 


His 


Pro 


Ala 


Pro 


Val 


His 


Gin 


Pro 


Pro 


Pro 


Leu 


Pro 


His 


Arg 


Pro 










410 










415 










420 


Pro 


Pro 


Pro 


Pro 


Pro 


Ser 


Ser 


Tyr 


Met 


Thr 


Gly 


Met 


Ser 


Thr 


Thr 










425 










430 










435 


Ser 


Ser 


Tyr 


Met 


Ser 


Gly 


Glu 


Gly 


Tyr 


Gin 


Ser 


Leu 


Gin 


Ser 


Met 










440 










445 










450 


Met 


Lys 


Thr 


Glu 


Gly 


Pro 


Ser 


Tyr 


Gly 


Ala 


Leu 


Pro 


Pro 


Ala 


Tyr 










455 










460 










465 


Gly 


Pro 


Pro 


Ala 


His 


Leu 


Pro 


Tyr 


His 


Pro 


His 


Val 


Tyr 


Pro 


Pro 










470 










475 










480 


Asn 


Pro 


Pro 


Pro 


Pro 


Pro 


Val 


Pro 


Pro 


Pro 


Pro 


Ala 


Ser 


Phe 


Pro 










485 










490 










495 


His 


Leu 


Pro 


Ser 


His 


Pro 


Leu 


Leu 


Leu 


Ala 


Thr 


Pro 


Asn 


Pro 


His 










500 










505 










510 


Pro 


Pro 


Thr 


Thr 


Pro 


Thr 


Ser 


His 


Pro 


His 


Pro 


His' Ala 


Ser 


Arg 










515 










520 










525 


Leu 


Pro 


Thr 


Gin 


Ser 


Pro 


Leu 


He 


Leu 


Leu 


Gin 


Gly 


Trp 


Ala 


Cys 










530 










535 










540 


Arg 


Gin 


Pro 


Ala 


Thr 


His 


Leu 


Leu 


Pro 


Ser 


Pro 


Leu 


Glu 


Asp 


Ser 










545 










550 










555 


Leu 


Leu 


Cys 


Pro 


Arg 


Pro 


Phe 


Pro 


His 


Pro 


Ala 


Cys 


Leu 


Gin 


Leu 










560 










565 










570 


Glu 


Gly 


Leu 


Gly 


Arg 


Ala 


Ala 


Trp 


Met 


Arg 




















575 










580 












<210> 42 


























<211> 131 


























<212> PRT 


























<213> Homo sapiens 






















<220> 




























<221> misc„ 


Eeature 






















<223> Incyte ID 


No: 


2683225CD1 
















<400> 42 


























Met 


Ala 


Glu 


Pro 


Asp 


Tyr 


He 


Glu 


Asp 


Asp 


Asn 


Pro 


Glu 


Leu 


He 


1 








5 










10 










15 


Arg 


Pro 


Gin 


Lys 


Leu 


He 


Asn 


Pro 


Val 


Lys 


Thr 


Ser 


Arg 


Asn 


His 










20 










25 










30 


Gin 


Asp 


Leu 


His 


Arg 


Glu 


Leu 


Leu 


Met 


Asn 


Gin 


Lys 


Arg 


Gly 


Leu 










35 










40 










45 


Ala 


Pro 


Gin 


Asn 


Lys 


Pro 


Glu 


Leu 


Gin 


Lys 


Val 


Met 


Glu 


Lys 


Arg 
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50 








55 




60 




Lys 


Arg 


Asp 


Gin 


Val 
65 


He 


Lys 


Gin Lys 


Glu 
70 


Glu Glu Ala Gin 


Lvs 
■ LJ jf ° 

75 




Lys 


Lys 


Ser 


Asp 


Leu 
80 


Glu 


He 


Glu Leu 


Leu 

,85 


Lys Arg Girt Gin 


Lys 

90 y 




Leu 


Glu 


Gin 


Leu 


Glu 


Leu 


Glu 


Lys* Gin 


Liys 


Leu / Gin Glu jGlu 


Gin 












95 




r 


100 




105 


* r 


Glu 


Asn 


Ala 


Pro 


Glu 
1*10 


Phe 


Val 


Lys Val 


Lys 
115 


Gly Asn Leu Arg 


Arg 
120 




Thr 


Gly 


Gin 


Glu 


Val 
125 


Ala 


Gin 


Ala Gin 


Glu 
130 


Ser 







<210> 43 
<211> 812 
<212> PRT 

<213> Homo sapiens 



<22~0> , ) . i 

<221> misc_f eature 

<223> Incyte ID No: 2797839CD1 . *~ . * 

<400> 43 



Meu. 


vjxy 


Arg 


Lys 


Leu 


Asp 


Pro 


Thr 


Lys 


Glu 


Lys 


Arg 


Gly 


Pro 


Gly 


1 








b 










10 










15 


Arg 


Lys 


Ala 

Ala 


Arg 


Lys 


G±n 


T 1 ri-m 

Lys 


Gly 


A±a 


Glu 


Thr 


Glu* 


Leu 


Val 


Arg 




















2 5 










30 


rue 


Leu 


Pro 


A±a 


vax 


ber 


Asp 


Glu 


Asn 


Ser 


Lys 


Arg 


Leu 


Ser 


Ser 










J b 










40 










45 


Arg 


nla 


Arg 


T a/c 

juys 


Arg 


Aid 


i\±a 


Lys 


Arg 


Arg 


Leu 


Gly 


Ser 


Val 


Glu 










-> u 




















60 


Ala 


Pro 


Lys 


i nr 


Asn 


Lys 


ber 


fro 


CaXU 


Ala 


Lys 


Pro 


Leu 


Pro 


Gly 




















1 0 










75 


Lys 


Leu 


Pro 


Lys 


\j±y 


x ±e 


Ser 


Aia 


Lily 


Ala 


val 


Gin 


Thr 


Ala 


Gly 










on 
o u 










o b 










90 


Lvs 


Lvs 


Gly 


Pro 


Gin 


Ser 


Leu 


Phe 


Asn 


Ala 


Pro 


Arg 


m v 

ui. y 


Lys 


Lys 










95 










100 










105 


Arg 


Pro 


Ala 


Pro 


Gly 


Ser 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Ser 










110 










115 










120 


Glu 


Glu 


Asp 


Gly 


Met 


Val 


Asn 


His 


Gly 


Asp 


Leu 


Trp 


Gly 


Ser 


Glu 










125 










130 










135 


Asp 


Asp 


Ala 


Asp 


Thr 


Val 


Asp 


Asp 


Tyr 


Gly 


Ala 


Asp 


Ser 


Asn 


Ser 










140 










145 










150 


Glu 


Asp 


Glu 


Glu 


Glu 


Gly 


Glu 


Ala 


Leu 


Leu 


Pro 


He 


Glu 


Arg 


Ala 










155 










160 








165 


Ala 


Arg 


Lys 


Gin 


Lys 


Ala 


Arg 


Glu 


Ala 


Ala 


Ala 


Gly 


He 


Gin 


Trp 










170 










175 










180 


Ser 


Glu 


Glu 


Glu 


Thr 


Glu 


Asp 


Glu 


Glu 


Glu 


Glu 


Lys 


Glu 


Val 


Thr 










185 










190 










195 


Pro 


Glu 


Ser 


Gly 


Pro 


Pro 


Lys 


Val 


Glu 


Glu 


Ala 


Asp 


Gly 


Gly 


Leu 










200 










205 








210 


Gin 


He 


Asn 


Val 


Asp 


Glu 


Glu 


Pro 


Phe 


Val 


Leu 


Pro 


Pro 


Ala 


Gly 










215 










220 










225 


Glu 


Met 


Glu 


Gin 


Asp 


Ala 


Gin 


Ala 


Pro 


Asp 


Leu 


Gin 


Arg 


Val 


His 










230 










235 










240 


Lys 


Arg 


lie 


Gin 


Asp 


He 


Val 


Gly 


He 


Leu 


Arg 


Asp 


Phe 


Gly 


Ala 










245 










250 








255 


Gin 


Arg 


Glu 


Glu 


Gly 


Arg 


Ser 


Arg 


Ser 


Glu 


Tyr 


Leu 


Asn 


Arg 


Leu 










260 










265 










270 


Lys 


Lys 


Asp 


Leu 


Ala 


He 


Tyr 


Tyr 


Ser 


Tyr 


Gly 


Asp 


Phe 


Leu 


Leu 










275 










280 










285 


Gly 


Lys 


Leu 


Met 


Asp 


Leu 


Phe 


Pro 


Leu 


Ser 


Glu 


Leu 


Val 


Glu 


Phe 










290 










295 










300 


Leu 


Glu 


Ala 


Asn 


Glu 


Val 


Pro 


Arg 


Pro 


Val 


Thr 


Leu 


Arg 


Thr 


Asn 



305 310 315 
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Thr 


Leu 


Lys 


Thr 


Arg 


Arg 


Arg 


Asp 


Leu 


Ala 


Gin 


Ala 


Leu 


He 


Asn 










320 










325 










330 


Arg 


Gly 


Val 


Asn 


Leu 


Asp 


Pro 


Leu 


Gly 


Lys 


Trp 


Ser 


Lys 


Thr 


Gly 






3 3 5' 










340 










345 


Leu 


Val 


Val 


Tyr 


Asp 


Ser 


Ser 


Val 


Pro 


He. 


Gly Ala 


Thr 


Pro*"* 


Glu 








350' 










355 










360 


Tvr 

-r Jr 


Leu 


Ala 


Gly His 


Tyr ' 


Met 


Leu 


Gin 


Gly 


Ala . 


Ser 


Ser 


Met 


Leu 








365 










370 










3 7 f 5 


Pro 


Val 


Met 


Ala 


Leu 


Ala 


Pro 


Gin 


Glu 


His 


Glu 


Arg 


He 


Leu v 


Asp 










380 










385 










390 


Met 


Cvs 


Cvs 

^* Jr *-* 


Ala 


Pro 


Gly 


Gly 


Lys 


Thr 


Ser 


Tyr 


Met 


Ala 


Gin 


Leu 






395 










400 










405 


Met 


LVS 


Asn 


Thr Gly Val 


lie ? Leu 


Ala 


Asn 


Asp 


Ala 


Asn 


Ala 


Glu 








a *i rt 
41U 










415 










420 


Arg. 


Leu 


Lvs 


Ser 


vai 


Val 


Gly Asn 


Leu 


His 


Arcr 


Leu 


Gly 


Val 


:Thr 
















430 










435 


Ash 


Thr 


lie 


lie 


Ser 


rilS 


Tyr 


Asp 


Gly 


Arg 


Gin 


Phe 


Pro 


Lys ;Val 










A A (\ 

44 0 










445 










450 


Val 


Glv 


Glv 

VJJ - Jr 


Phe 


Asp 


Arg 


vai 


Leu 


Leu 


Asp 


Ala 


Pro 


Cys 


Ser 


Gly 










/ICC 

4 bb 










460 










465 


Thr 


Glv 


Val 


lie 


Ser 


Lys 


Asp 


Pro 


Ala 


Val 


Lys 


Thr 


Asn 


Lys 


Asp 








4 7 0 










475 










480 


Glu 


Lys 


Asp 


lie 


Leu 


Arg 


Cys 


Ala 


His 


Leu 


Gin 


Lys 


Glu 


Leu 


Leu 








485 










490 










495 


Leu 


Ser 


Ala 


lie 


Asp 


Ser 


Val 


Asn 


Ala 


Thr 


Ser 


Lys 


Thr 


Gly Gly 










500 










505 










SI 0 

J1U 


Tvr 

-Y 


Leu 


Val 


Tyr 


Cys 


Thr 


Cys 


Ser 


lie 


Thr 


Val 


Glu 


Glu 


Asn 


Glu 








bib 










520 










525 




Val 


Val 


Asp 


Tyr 


Ala 


Leu 


Lys 


Lys 


Arg 


Asn 


Val 


Arg 


Leu 


Val 








b J o 










535 










540 


Pro 


Thr 


Glv 

w Jr 


Leu 


Asp 


fne 


pi -*r 

biy 


bin 


Glu 


Gly 


Phe 


Thr 


Arg 


Phe 


Arg 








b 4 b 










550 










555 


Glu 


Arcr 


Arg 


T-l Vl -a 

rile 


nlS 


Pro 




Leu 


Arg 


Ser 


Thr 


Arg 


Arg 


Phe 


Tyr 
















565 










570 


Pro 


His 


Thr 


MIS 


Asn 


Met- 
ric L. 


Asp 


uiy 


Phe 


Phe 


He 


Ala 


Lys 


Phe 


Lys 










j / D 










580 










585 


Lys 


Phe 


Ser 


Asn 


Ser 


Tip 

lie 


rt u 




Ser 


Gin 


Thr 


Glv 


Asn 


Ser 


Glu 


















595 










600 


Thr 


Ala 


Thr 


Pro 






Val 


Asp 


Leu 


Pro 


Gin 


Val 


He 


Pro 


Lys 










a ns 










610 










615 


Ser 


Glu 


Asn 


Ser 


Cay 
OCX, 


Gin 


Pro 


Ala 


Lys 


Lys 


Ala 


Lys 


Gly 


Ala 


Ala 










con 
o ^ u 










625 










630 


Lys 


Thr 


Lvs 


Gin 


Gin 


Leu 


Gin 


Lys 


Gin 


Gin 


His 


Pro 


Lys 


Lys 


Ala 






635 










640 










645 


Ser 


Phe 


Gin 


Lys 


Leu 


Asn 


Gly 


lie 


Ser 


Lys 


Glv 


Ala 


Asp 


Ser 


Glu 










650 










655 










660 


Leu 


Ser 


Thr 


Val 


Pro 


Ser 


Val 


Thr 


Lys 


Thr 


Gin 


Ala 


Ser 


Ser 


Ser 










665 










670 










675 


Phe 


Gin 


Asp 


Ser 


Ser 


Gin 


Pro 


Ala 


Gly 


Lys 


Ala 


Glu 


Gly 


He 


Arg 








680 










685 










690 


Glu 


Pro 


Lys 


Val 


Thr 


Gly 


Lys 


Leu 


Lys 


Gin 


Arg 


Ser 


Pro 


Lys 


Leu 










695 










700 










705 


Gin 


Ser 


Ser 


Lys 


Lys 


Val 


Ala 


Phe 


Leu 


Arg 


Gin 


Asn 


Ala 


Pro 


Pro 










710 










715 










720 


Lys 


Gly 


Thr 


Asp 


Thr 


Gin 


Thr 


Pro 


Ala 


Val 


Leu 


Ser 


Pro 


Ser 


Lys 








725 










730 










735 


Thr 


Gin 


Ala 


Thr 


Leu 


Lys 


Pro 


Lys 


Asp 


His 


His 


Gin 


Pro 


Leu 


Gly 










740 










745 










750 


Arg 


Ala 


Lys 


Gly 


Val 


Glu 


Lys 


Gin 


Gin 


Leu 


Pro 


Glu 


Gin 


Pro 


Phe 








755 










760 










765 


Glu 


Lys 


Ala 


Ala 


Phe 


Gin 


Lys 


Gin 


Asn 


Asp 


Thr 


Pro 


Lys 


Gly 


Pro 








770 










775 










780 


Gin 


Pro 


Pro 


Thr 


Val 


Ser 


Pro 


lie 


Arg 


Ser 


Ser 


Arg 


Pro 


Pro 


Pro 



41/93 



WO <M*W71 PCT/USOO/19948 

. 7 85 790 795 

Ala Lys Arg Lys Lys Ser Gin Ser Arg Gly Asn Ser Gin Leu Leu 

800 805 sio -f 

Leu Ser * oxu 

c f 

<210> 44*'' < ~ v ' r ■ V K 

<211> 537 " . c ' : 

<212> PRT ( ( i 1 



<213> Homo sapiens v , ' ' * 

' f r 

<220>* , X * ^ 

: <221> misc_f eature v ' 

<223> Incyte ID No : r 2959521CDl . 

<400> 44 , 1 , ' 

Met: Arg Gly Val Gly Ala Arg Val Tyr Ala Asp Ala Pro Ala Lys 
1 \ 5 - V 10 15 

Leu Leu Leu Pro Pro v Pro Ala Ala. Trp Asp Leu Ala Val Arg Leu 
20 2 5 30 

Arg Gly Ala Glu Ala Ala Ser Glu Arg Gin Val Tyr Ser Val Thr 
35 40 45 

Met Lys Leu Leu Leu Leu His Pro ^la Phe Gin Ser Cys Leu Leu 
50 55 : 5 o 

Leu Thr Leu Leu Gly Leu Trp Arg Thr Thr Pro Glu Ala His Ala 
65 70 75 

Ser Ser Leu Gly Ala Pro Ala lie Ser Ala Ala Ser Phe Leu Gin 
80 85 90 

Asp Leu He His Arg Tyr Gly Glu Gly Asp Ser Leu Thr Leu Gin 
95 100 105 

Gin Leu Lys Ala Leu Leu Asn His Leu Asp Val Gly Val Gly Arcr 
110 H5 120 

Gly Asn Val Thr Gin His Val Gin Gly His Arg Asn Leu Ser Thr 
125 130 135 

Cys Phe Ser Ser Gly Asp Leu Phe Thr Ala His Asn Phe Ser Glu 
140 145 iso 

Gin Ser Arg He Gly Ser Ser Glu Leu Gin Glu Phe Cys Pro Thr 
155 16Q 165 

He Leu Gin Gin Leu Asp Ser Arg Ala Cys Thr Ser Glu Asn Gin 
170 175 iso 

Glu Asn Glu Glu Asn Glu Gin Thr Glu Glu Gly Arg Pro Ser Ala 
I 85 190 195 

Val Glu Val Trp Gly Tyr Gly Leu Leu Cys Val Thr Val He Ser 
200 205 210 

Leu Cys Ser Leu Leu Gly Ala Ser Val Val Pro Phe Met Lys Lys 
215 220 225 

Thr Phe Tyr Lys Arg Leu Leu Leu Tyr Phe He Ala Leu Ala He 
230 235 240 

Gly Thr Leu Tyr Ser Asn Ala Leu Phe Gin Leu He Pro Glu Ala 
245 250 255 

Phe Gly Phe Asn Pro Leu Glu Asp Tyr Tyr Val Ser Lys Ser Ala 
260 265 270 

Val Val Phe Gly Gly Phe Tyr Leu Phe Phe Phe Thr Glu Lys He 
275 280 285 

Leu Lys He Leu Leu Lys Gin Lys Asn Glu His His His Gly His 
290 295 300 

Ser His Tyr Ala Ser Glu Ser Leu Pro Ser Lys Lys Asp Gin Glu 
305 310 315 

Glu Gly Val Met Glu Lys Leu Gin Asn Gly Asp Leu Asp His Met 
320 325 330 

He Pro Gin His Cys Ser Ser Glu Leu Asp Gly Lys Ala Pro Met 
335 340 345 

Val Asp Glu Lys Val He Val Gly Ser Leu Ser Val Gin Asp Leu 
350 355 360 
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Gin Ala Ser Gin Ser Ala Cys Tyr Trp Leu Lys Gly Val Arg Tyr 

365 370 375 

Ser Asp lie Gly Thr Leu Ala Trp Met lie Thr Leu Ser Asp Gly 

380 385 390 

Leu His Asn Phe He Asp Gly Leu .Ala He Gly Ala Ser Phe Thr 
s ,395 . 400 405 

Val Ser Val Phe: Gin Gly He Ser Thr Ser Val- Ala He Leu Cys 

410 " 415 420 

Glu Glu Phe Pro His Glu Leu Gly Asp Phe Val lie Leu Leu Asn 

425 430 435 

Ala Gly Met Ser He Gin Gin Ala Leu Phe Phe Asn Phe Leu Ser 

440 445 " - 450 

Ala Cys Cys Cys^Tyr Leu Gly Leu Ala Phe Gly He Leu Ala Gly . 

455 460 465 

Ser His Phe Ser Ala Asn Trp He Phe Ala Leu Ala Gly Gly Met 

470 475 480 

Phe' Leu Tyr He Ser Leu Ala Asp Met Pheiv Pro Glu Met Asn Glu 

485 490 495 

Val Cys Gin Glu Asp Glu Arg Lys Gly Ser lie Leu He Pro Phe 

500 505 510 

He He Gin Asn ' Leu Gly Leu Leu Thr Gly Phe Thr He Met Val 
5 515 520 525 

Val Leu Thr Met Tyr Ser Gly Gin He Gin He Gly 

530 535 

<210> 45 
<211>- 584 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3082014CD1 

<400> 45 

Met Leu Trp Gly Gly Arg Val Gly Leu Thr Gly Val Phe Gin Ser 
15 10 15 

Leu Ser Tyr Arg Gly Lys Cys Ser Val Thr Leu Leu Asn Glu Thr 
20 25 30 

Asp He Leu Ser Gin Tyr Leu Glu Lys Glu Asp Cys Phe Phe Tyr 
35 40 45 

s e r Leu Val Phe Asp Pro Val Gin Lys Thr Leu Leu Ala A:sp Gin 
50 55 60 

Gly Glu He Arg Val Gly Cys Lys Tyr Gin Ala Glu He Pro Asp 
65 70 75 

Arg Leu Val Glu Gly Glu Ser Asp Asn Arg Asn Gin Gin Lys Met 
80 85 90 

Glu Met Lys Val Trp Asp Pro Asp Asn Pro Leu Thr Asp Arg Gin 
95 100 105 

He Asp Gin Phe Leu Val Val Ala Arg Ala Val Gly Thr Phe Ala 
110 115 120 

Arg Ala Leu Asp Cys Ser Ser Ser He Arg Gin Pro Ser Leu His 
125 130 135 

Met Ser Ala Ala Ala Ala Ser Arg Asp He Thr Leu Phe His Ala 
140 145 150 

Met Asp Thr Leu Gin Arg Asn Gly Tyr Asp Leu Ala Lys Ala Met 
155 160 165 

Ser Thr Leu Val Pro Gin Gly Gly Pro Val Leu Cys Arg Asp Glu 
170 175 180 

Met Glu Glu Trp Ser Ala Ser Glu Ala Met Leu Phe Glu Glu Ala 
185 190 195 

Leu Glu Lys Tyr Gly Lys Asp Phe Asn Asp He Arg Gin Asp Phe 
200 205 . 210 

Leu Pro Trp Lys Ser Leu Ala Ser He Val Gin Phe Tyr Tyr Met 
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215 220 225 



Trp Lys Thr Thr Asp Arg Tyr He Gin Gin Lys Arg Leu Lys Ala 
230 235 240- 

Ala Glu Ala Asp Ser Lys Leu Lys Gin Val Tyr - He Pro Thr Tyr 
' 245 * 250 255 

Thr Lys Pro ( -Asn Pro Asn Gin lie He Ser Val Gly Ser Lys Pro 
-v 260 265'* 270 

Gly Met Asn Gly Ala Gly/ fee Gin Lys > Gly Leu Thr /Cys Glu Ser 
' 275 i 'x 280 1 . '285 

Cys His. Thr Thr Gin Ser bia Gin y Trp Tyr Ala Trp Gly Pro ,Pro 



290 ' 29-5 300 

: Ash Met Gin Cys Arg Leu t Cys Ala Ser Cys Trp lie /Tyr Trp Lys 

315 
Thr 
330 



Lys Tyr Gly, Gly Leu LysvThr Pro Thr Gin Leu Glu -Gly Ala Thr 

.320 < ' 325 * . 330 

Arg Gly Thr Thr Glu Pro, His Ser Arg Gly His Leu Ser Arg Pro 

v 335 / ■ .340 - 



305 ' „ ^ . 310 315 

Thr 
330 
Pro 

Glu Ala' Gln Ser Leu Ser Pro Tyr Thr Thr Ser Ala Asn Arg Ala 
350 .* " ' 355 . 360 

Lys Leu Leu Ala Lys Asn Arg Gin Thr Phe Leu Leu Gin Thr Thr 
* 365 370 375 

Lys Leu Thr Arg Leu Ala Arg Arg Met Cys Arg Asp Leu Leu Gin 
t, » 380 385 390 

Pro Arg Arg Ala Ala Arg Arg Pro Tyr Ala Pro He Asn Ala Asn 
395 400 405 

Ala He Lys Ala Glu Cys Ser He Arg Leu Pro Lys Ala Ala Lys 

410 415 * 

Thr Pro Leu Lys He His Pro Leu Val Arg Leu Pro Leu Ala Thr 
425 43 0 435 

He Val Lys Asp Leu Val Ala Gin Ala Pro Leu Lys Pro Lys Thr 
440 445 450 

Pro Arg Gly Thr Lys Thr Pro He Asn Arg Asn Gin Leu Ser Gin 
455 460 465 

Asn Arg Gly Leu Gly Gly He Met Val Lys Arg Ala Tyr Glu Thr 
470 475 48Q 

Met Ala Gly Ala Gly Val Pro Phe Ser Ala Asn Gly Arg Pro Leu 
485 4 9 o 4g5 

Ala Ser Gly He Arg Ser Ser Ser Gin Pro Ala Ala Lys Arg Gin 
500 505 510 

Lys Leu Asn Pro Ala Asp Ala Pro Asn Pro Val Val Phe Val Ala 
515 520 525 

Thr Lys Asp Thr Arg Ala Leu Arg Lys Ala Leu Thr His Leu Glu 
530 535 54Q 

Met Arg Arg Ala Ala Arg Arg Pro Asn Leu Pro Leu Lys Val Lys 
545 550 555 

Pro Thr Leu He Ala Val Arg Pro Pro Val Pro Leu Pro Ala Pro 
560 5 6 5 5?0 

Ser His Pro Ala Ser Thr Asn Glu Pro He Val Leu Glu Asp 
575 580 

<210> 46 
<211> 425 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3520701CD1 

<400> 46 

Met Ala Gly Ala Glu Gly Ala Ala Gly Arg Gin Ser Glu Leu Glu 

« - 5 10 15 

Pro Val Val Ser Leu Val Asp Val Leu Glu Glu Asp Glu Glu Leu 
20 25 3Q 



44/93 



WO 01/07471 



PCT/USOO/19948 





Asn 


Glu 


Ala 


Cvs 


Ala 


Val 


Leu 


Gly Gly 


Ser 


Asp 


Ser 


Glu 


Lys 








35 










40 










45 


Cys 


Ser- Tyr 


Car 
SCI 


Gin 


Glv 

vj J. y 


Ser 


Val 


Lys 




Gin 
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<212> PRT 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 4184320CD1 
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<400> 47 

Met Tyr Val Arg Val Ser Phe Asp Thr Lys Pro Asp Leu Leu Leu 

1 £ 10 15 ? 

His Leu Met Thr Lys'Glu. Trp Gin Leu Glu Leu Pro Lys * Leu Leu " * 

- 20 $ " 25 v <30 J > 

lie Ser Val His Gly Gly Leu Gin Asn Phe Glu Leu feln Pro : Lys - 

' 35 ; ' 40 " ■ r 45 " : - • 

Leu Lys Gin Val Phe Gly Lys Gly Leu lie Lys-' 'Ala Ala Met Thr ' 

50 : 55 ; ' so- 

Thr Gly- Ala Trp He Phe Thr. Gly. Gly Val Asn Thr Gly Val He ' 

65 1 - 70 75 : 

Arg His Val Gly Asp Ala Leu Lys. Asp His Ala Ser Lys Ser Arg ^ / 

' ' * 8 -° " , - " 85 SO 

Gly Lys He Cys Thr He Gly He Ala Pro Trp Gly He Val Glu ' 

95 • " ' 100 105 ' 

Asn Gin Glu Asp Leu • He Gly Arg Asp Val Val Arg Pro Tyr Gin 
' f I 10 ' . ■ 1 r - ' 115 < 120' 

Thr Met Ser Asn Pro Met Ser Lys Leu Thr Val Lsu Asn Ser Met 
/ t ' 125 , 130 c 1 . 135 

His £er His Phe He Leu Ala Asp Asn Gly Thr Thr Gly Lys Tyr " 

' 140 145 , 150 

Gly Ala Glu Val Lys Leu Arg A:rg Gin Leu Glu Lys His He Ser 
155 160 165 

Leu Gin Lys He Asn Thr Arg Cys Leu Pro Phe Phe Ser Leu Aso 
170 175 i8o 

Ser Arg Leu Phe Tyr Ser Phe Trp Gly Ser Cys Gin Leu Asp Ser 
185 190 195 

Val Gly He Gly Gin Gly Val Pro Val Val Ala Leu lie Val Glu 
200 205 210 

Gly Gly Pro Asn Val He Ser He Val Leu Glu Tyr Leu Arg Asp 
215 220 225 

Thr Pro Pro Val Pro Val Val Val Cys Asp Gly Ser Gly Arg Ala 
230 235 240 

Ser Asp He Leu Ala Phe Gly His Lys Tyr Ser Glu Glu Gly Gly 
245 250 255 

<210> 48 

<211> 111 

<212> PRT 

<2 13> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4764233CD1 

<400> 48 

Met Ser Trp Arg Gly Arg Ser Thr Tyr Arg Pro Arg Pro Arg Arg 

5 1° 15 

Ser Leu Gin Pro Pro Glu Leu He Gly Ala Met Leu Glu Pro Thr 

20 25 30 
Asp Glu Glu Pro Lys Glu Glu Lys Pro Pro Thr Lys Ser Arg Asn 

35 4 0 45 

Pro Thr Pro Asp Gin Lys Arg Glu Asp Asp Gin Gly Ala Ala Glu 

50 55 6Q 

He Gin Val Pro Asp Leu Glu Ala Asp Leu Gin Glu Leu Cys Gin 

65 70 75 

Thr Lys Thr Gly Asp Gly Cys Glu Gly Gly Thr Asp Val Lys Glv 

80 85 90 

Lys He Leu Pro Lys Ala Glu His Phe Lys Met Pro Glu Ala Gly 

95 100 105 

Glu Gly Lys Ser Gin Val 

110 

<210> 49 
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<211> 422 
<212> PRT 

<213> Homo sapiens 

<220> ... 

<221> misc_feature 

<223> Incyte ID No: 4817352CD1 , 

<400> 49 
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Arg Asp Arg-Arg Pro Leu Leu Thr Ala Pro Asp His Cys Ser Asp 
410 415 420 

Asp Ala 

<210> '50 ' . . 4 " * . r 

<211> 397 , r - ■ i V 

<212>: PRT ; ' 
<213> Homo sapiens 

<22 0> i . 

<221> misc_f eature * . v 

<223> Incyte ID No: 5040573CD1 ^ ^ 

<400> 50 v ? * f 

Met Ala Met He Glu Leu' Gly. Phe Gly Arg Gin Asn Phe His Pro 
V 5 10 15 

'Leu 'Lys Arg Lys Ser Ser Leu, Leu iLeu Lys Leu rie Ala Val Val 
i 20 . 25 , 30 

Phe Ala Val Leu Leu Phe Cys Glu .Phe Leu Her Tyr Tyr Leu Ala 
35 40 1 45 

He Phe Gin Cys Asn Trp Pro Glu Val Lys Thr Thr Ala Ser Asp 
50 55 so 

Gly Glu Gin Thr Thr Arg Glu Pro Val Leu Lys Ala Met Phe Leu 
65 70 75 

Ala Asp Thr His Leu Leu Gly Glu Phe Leu Gly His Trp Leu Asp 
80 85 90 

Lys Leu Arg Arg Glu Trp Gin Met Glu Arg Ala Phe Gin Thr Ala 
95 100 105 

Leu Trp Leu Leu Gin Pro Glu Val Val Phe He Leu Gly Asp He 
11° 115 120 

Phe Asp Glu Gly Lys Trp Ser Thr Pro Glu Ala Trp Ala Asp Asp 
125 130 135 

Val Glu Arg Phe Gin Lys Met Phe Arg His Pro Ser His Val Gin 
140 145 150 

Leu Lys Val Val Ala Gly Asn His Asp He Gly Phe His Tyr Glu 
155 160 165 

Met Asn Thr Tyr Lys Val Glu Arg Phe Glu Lys Val Phe Ser Ser 
170 175 180 

Glu Arg Leu Phe Ser Trp Lys Gly He Asn Phe Val Met Val Asn 
185 190 195 

Ser Val Ala Leu Asn Gly Asp Gly Cys Gly He Cys Ser Glu Thr 
200 205 210 

Glu Ala Glu Leu He Glu Val Ser His Arg Leu Asn Cys Ser Arg 
215 220 225 

Glu Gin Ala Arg Gly Ser Ser Arg Cys Gly Pro Gly Pro Leu Leu 
230 235 240 

Pro Thr Ser Ala Pro Val Leu Leu Gin His Tyr Pro Leu Tyr Arg 
245 250 255 

Arg Ser Asp Ala Asn Cys Ser Gly Glu Asp Ala Ala Pro Pro Glu 
260 265 270 

Glu Arg Asp He Pro Phe Lys Glu Asn Tyr Asp Val Leu Ser Arg 
275 280 285 

Glu Ala Ser Gin Lys Leu Leu Trp Trp Leu Gin Pro Arg Leu Val 
290 295 300 

Leu Ser Gly His Thr His Ser Ala Cys Glu Val His His Gly Gly 
305 310 315 

Arg Val Pro Glu Leu Ser Val Pro Ser Phe Ser Trp Arg Asn Arg 
320 325 330 

Asn Asn Pro Ser Phe He Met Gly Ser lie Thr Pro Thr Asp Tyr 
335 340 345 

Thr Leu Ser Lys Cys Tyr Leu Pro Arg Glu Asp Val Val Leu He 
350 355 360 

He Tyr Cys Gly Val Val Gly Phe Leu Val Val Leu Thr Leu Thr 



48/93 



WO 01/07471 



PCT/US00/19948 



365 370 375 

His Phe Gly Leu Leu Ala' Ser Pro Phe Leu Ser Gly Leu Asn Leu 

380 385 390 

Leu Gly Lys Arg Lys Thr Arg . ■ i- 

395 

<210> 51 ■ " . f 

<2li> 800 • ■ . 

<212> PRT* - : 

■<213> Homo sapiens : ■ ; 

<220> - ' . . ' • • 

<221> misc. feature " 
<223> Incyte ID No: 5627029CD1 f 

• _ . ' - ■ • . . . r 

<400> 51- ' • " ■ " " 
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Arg Glu Arg Glu Leu Glu Glu lie Arg Ala Lys Leu Arg Leu Gin 
365 370 375 

Ala Gin Ser Leu Ser Thr Val Gly Pro Arg Leu Ala Ser Glu Tyr 

380 385 390 

Leu Thr Pro Glu Glu Met Val .^Thr Phe ' Lys Lys Thr Lys Arg Arg ' 

? 395 ' i : .400 . ■ * 405 

Val Lys Lys lie. Arg Lys Lys Glu Lys* Glu Val Val Val Arg Ala '< 

410 v ^ 415 420 

Asp Asp Leu Leu Pro Leu Gly' Asp Glrf Thr -Gin Asp Gly Asp Phe^ 

" 425 , 430.' ^ ' 435 

Gly Ser "Arg Leu Arg Gly Arg Gly Arg Arg iLrg Val Ser Glu Val - 

440 . 4 445 450 

Glu Glu Glu Lys Glu Pro Val 1 Pro Gin Pro Leu Pro Ser' Asp Asp 
- 455 * 460 465. 

Thr Arg Val Glu Asn Met Asp lie Ser Asp Glu Glu Glu Gly Gly' 

470 , 475 480 t 

Ala Pro Pro Pro Ala Sex Pro' Gin Val Leu Glu Glu Asp Glu Ala' 
* 485 . 490 ' 495 

Glu Leu Glu Leu Gin Lys Gin Leu Glu Lys Gly Arg Arg Leu Arg ^ 

500 - 505 510- 

Gln Leu Gin Gin Leu Gin Gin Leu Arg Asp Ser Gly Glu Lys Val 

515 520 525 

Val Glu He Val Lys Lys Leu Glu Ser Arg Gin Arg Gly Trp Glu 

530 535 540 

Glu Asp Glu Asp Pro Glu Arg Lys Gly Ala He Val Phe Asn Ala 

545 550 555 

Thr Ser Glu Phe Cys Arg Thr Leu Gly Glu lie Pro Thr Tyr Gly 

560 565 570 

Leu Ala Gly Asn Arg Glu Glu Gin Glu Glu Leu Met Asp Phe Glu 

575 580 585 

Arg Asp Glu Glu Arg Ser Ala Asn Gly Gly Ser Glu Ser Asp Gly 

590 595 600 

Glu Glu Asn lie Gly Trp Ser Thr Val Asn Leu Asp Glu Glu Lys 

605 610 615 

Gin Gin Gin Asp Phe Ser Ala Ser Ser Thr Thr He Leu Asp Glu 

620 625 630 

Glu Pro He Val Asn Arg Gly Leu Ala Ala Ala Leu Leu Leu Cys 

635 640 645 

Gin Asn Lys Gly Leu Leu Glu Thr Thr Val Gin Lys Val Ala Arg 

650 655 660 

Val Lys Ala Pro Asn Lys Ser Leu Pro Ser Ala Val Tyr Cys He 

665 670 675 

Glu Asp Lys Met Ala He Asp Asp Lys Tyr Ser Arg Arg Glu Glu 

680 685 690 

Tyr Arg Gly Phe Thr Gin Asp Phe Lys Glu Lys Asp Gly Tyr Lys 

695 700 705 

Pro Asp Val Lys He Glu Tyr Val Asp Glu Thr Gly Arg Lys Leu 

710 715 720 

Thr Pro Lys Glu Ala Phe Arg Gin Leu Ser His Arg Phe His Gly 

725 730 735 

Lys Gly Ser Gly Lys Met Lys Thr Glu Arg Arg Met Lys Lys Leu 

740 745 750 

Asp Glu Glu Ala Leu Leu Lys Lys Met Ser Ser Ser Asp Thr Pro 

755 760 765 

Leu Gly Thr Val Ala Leu Leu Gin Glu Lys Gin Lys Ala Gin Lys 

770 775 780 

Thr Pro Tyr He Val Leu Ser Gly Ser Gly Lys Ser Met Asn Ala 

785 790 795 

Asn Thr He Thr Lys 

800 

<210> 52 
<211> 713 
<212> PRT 
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<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5678487CD1 
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Gin val Glu Lys Val Thr.Lys Glu Lys He Ser Ala He Asn Gin * 

42 5 430 435 

Leu Glu Glu He Gin Ser Gin Leu Ala Ser Arg Glu Met Asp Val 

440 445 450 
Thr Lys Val Cys\Gly Glu Met, Arg Tyr Gin ' Leu Asn Lys .Thr Asn ' ' 

455 r 1 460 ' 4 6 c 

Met Glu Lys Asp. Glu Ala Glu Lys Glu Hi's Arg Glu Phe Arg Ala 

470 ' * 4 ?5 - 1 480 

Lys Thr Asn Arg Asp Leu Glu l Ile Lys Asp Gin Glu He Glu Lys' ; * 

485 - 490 4 | 5 



. Leu Arg He. Glu Leu Asp 'Glu Ser Lys Gin His Leu <Glu 'G1A Glu * 
*" ' ~ v 5 °P 1 ' 505 f sin ■ 

Gin Gin Lys Ala Ala Leu Ala Arg Glu Glu Cys Leu Arg Leu Thr 
515 ' " 5 20' 1 525 

Glu Leu Leu Gly Glu Ser Glu His Gin lieu His, Leu Thr Arg Gin v 

. ' * 53 9 535 540 

Glu Lys Asp Ser G\n Gin Ser Phe Ser Lys v GW Ala Lys Ala 

- < 545 * - ^-550 v 555 

Gin Ala Leu Gin A|a Gin Gin Arg Glu Gin Glu Leu Thr Gin Lys 

He Gin Gin Met Glu A1A Gin His Asp Lys Thr Glu Asn Glu Gin" x 

^ t 575 580 585 

Tyr Leu Leu Leu Thr Ser Gin Asn Thr Phe Leu Thr Lys Leu Lys 

Glu Glu Cys Cys Thr Leu Ala Lys Lys Leu Glu Gin He Ser Gin 
605 610 6i5 

Lys Thr Arg Ser Glu He Ala Gin Leu Ser Gin Glu Lys Arg Tyr 

Thr Tyr Asp Lys Leu Gly Lys Leu Gin Arg Arg Asn Glu Glu Leu 
635 640 645 

Glu Glu Gin Cys Val Gin His Gly Arg Val His Glu Thr Met Lys 

650 65 $ 660 

Gin Arg Leu Arg Gin Leu Asp Lys His Ser Gin Ala Thr Ala Gin 

665 670 675 

Gin Leu Val Gin Leu Leu Ser Lys Gin Asn Gin Leu Leu Leu Glu 

680 685 690 

Arg Gin Ser Leu Ser Glu Glu Val Asp Arg Leu Arg Thr Gin Leu 

695 700 7Q5 

Pro Ser Met Pro Gin Ser Asp Cys 

710 

<210> 53 

<211> 880 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5682976CD1 
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Gin Glu Gly Val Ser Ala Glu Asn Pro Val Glu Asn His He. Asn 

5 ? 5 580 585 

He Thr Gin Ser Asp Lys Phe Thr Ala Lys Pro Leu Asp Ser Asn t 

590 595 * 600 - 

Ser Gly Glu Arg Asn Asp Leu Asn Leu Asp Arg Ser Cys Gly Val • 

r s 605 t 610 *. ; 615 

Pro Glu Glu Ser. Ala Ser Ser Glu Lys Ala Lys Glu Pro Glu Thr r- ' 

t ' 620 ea5 * 630 * 

Ser* Asp Gin Thr Ser Thr Glu Ser Ala Thr , Asn Glu , Asn Asn Thr > - ? 

635 . 640 645 

Asn Pro Glu *Pro Gin Phe. Gin Thr .Glu Ala Thr Gly Pro' Ser Ala' \ 

r ' 650 -655 : ^660 . 

His Glu Glu Thr Ser Thr Arg : Asp Ser Ala Leu Gin Asp Thr Asp' ! 

665 670 675' 

Asp Ser Asp Asp Asp Pro Val Leu He Pro -Gly 1 Ala Arg Tyr Arg '* 

680 685 '* 690 

Ala Gly Pro Gly Asp Arg Arg Ser ; Ala Val Ala Arg He Gin Glu 

69,5 * 700 < i 705 

Phe Phe Arg Arg Arg Lys. Glu Arg Lys Glu Met Glu Glu = Leu Asp ' 

■nu r * - 710 v 715 720 

Tnr Leu Asn He Arg Arg Pro Leu Val Lys Met Val Tyr Lys Gly t 

725 730 735 

His Arg Asn Ser Arg Thr Met He Lys Glu Ala Asn Phe Trp Gly 

740 745 750 

Ala Asn Phe Val Met Ser Gly Ser Asp Cys Gly His He Phe He 

755 760 765 

Trp Asp Arg His Thr Ala Glu His Leu Met Leu Leu Glu Ala Asp 

770 775 780 

Asn His Val Val Asn Cys Leu Gin Pro His Pro Phe Asp Pro He 

785 790 795 

Leu Ala Ser Ser Gly He Asp Tyr Asp He Lys He Trp Ser Pro 

800 805 810 

Leu Glu Glu Ser Arg He Phe Asn Arg Lys Leu Ala Asp Glu Val 

815 820 825 

He Thr Arg Asn Glu Leu Met Leu Glu Glu Thr Arg Asn Thr He 

830 835 840 

Thr Val Pro Ala Ser Phe Met Leu Arg Met Leu Ala Ser Leu Asn 

845 850 855 

His He Arg Ala Asp Arg Leu Glu Gly Asp Arg Ser Glu Gly Ser 

860 865 870 

Gly Gin Glu Asn Glu Asn Glu Asp Glu Glu 

875 880 

<210> 54 
<211> 855 
<212> PRT 

< 2 1 3 > Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5992432CD1 
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80 85 90 

Ala Tyr Glu Asp Val Asn Asn Cys His Glu Arg Ala Phe Val Phe 
95 100 105 

Met His Lys Met Pro Arg Leu Trp Leu Asp 'Tyr Cys Gin Phe Leu 
110 115 ■ 120 

Met Asp- Gin Gly Arg Val Thr His Thr Arg Arg Thr Phe Asp Arg t 

125 ' 130 r 135 

Ala Leu Arg Ala Leu Pro He Thr Gin His Ser Arg. He Trp Pro 
. 140 145 • 150 

Leu Tyr Leu Arg Phe Leu Arg: ser His Pro Leu Pro Glu Thr Ala 
155 160- 165 

Val Arg Gly Tyr Arg Arg Phe. Leu Lys Leu Ser Pro Glut Ser Ala. 

170 - 175 t 180 

Glu Glu Tyr He Glu 'Tyr Leu Lys Ser Ser Asp Arg Leu; Asp Glu 
: 185 190 195 

Ala Ala Gin Arg Leu Ala Thr Val Val Asn Asp?., Glu Arg Phe Val 
200 " .. " 205 ; 210 

Ser Lys Ala Gly Lys Ser Asn Tyr Gin Leu Trp His Glu Leu Cys 
215 . 220 225 

Asp Leu He Ser Gin Asn Pro Asp Lys Val Gin Ser Leu Asn Val 
230 235 ^ 240 

Asp Ala lie lie Arg Gly Gly Leu Thr Arg Phe Thr Asp Gin Leu 
245 250 255 

Gly Lys Leu Trp Cys Ser Leu Ala Asp Tyr Tyr He Arg Ser Gly 
260 265 270 

His Phe Glu Lys Ala Arg Asp Val Tyr Glu Glu Ala He Arg Thr 
275 280 285 

Val Met Thr Val Arg Asp Phe Thr Gin Val Phe Asp Ser Tyr Ala 
290 295 300 

Gin Phe Glu Glu Ser Met He Ala Ala Lys Met Glu Thr Ala Ser 
305 310 315 

Glu Leu Gly Arg Glu Glu Glu Asp Asp Val Asp Leu Glu Leu Arg 
320 325 330 

Leu Ala Arg Phe Glu Gin Leu He Ser Arg Arg Pro Leu Leu Leu 
335 340 345 

Asn Ser Val Leu Leu Arg Gin Asn Pro His His Val His Glu Trp 
350 355 360 

His Lys Arg Val Ala Leu His Gin Gly Arg Pro Arg Glu He He 
365 370 375 

Asn Thr Tyr Thr Glu Ala Val Gin Thr Val Asp Pro Phe Lys Ala 
380 385 390 

Thr Gly Lys Pro His Thr Leu Trp Val Ala Phe Ala Lys Phe Tyr 
395 400 405 

Glu Asp Asn Gly Gin Leu Asp Asp Ala Arg Val He Leu Glu Lys 
410 415 420 

Ala Thr Lys Val Asn Phe Lys Gin Val Asp Asp Leu Ala Ser Val 
425 430 435 

Trp Cys Gin Cys Gly Glu Leu Glu Leu Arg His Glu Asn Tyr Asp 
440 445 450 

Glu Ala Leu Arg Leu Leu Arg Lys Ala Thr Ala Leu Pro Ala Arg 
455 460 465 

Arg Ala Glu Tyr Phe Asp Gly Ser Glu Pro Val Gin Asn Arg Val 
470 475 480 

Tyr Lys Ser Leu Lys Val Trp Ser Met Leu Ala Asp Leu Glu Glu 
485 490 495 

Ser Leu Gly Thr Phe Gin Ser Thr Lys Ala Val Tyr Asp Arg He 
500 505 510 

Leu Asp Leu Arg He Ala Thr Pro Gin He Val He Asn Tyr Ala 
515 520 525 

Met Phe Leu Glu Glu His Lys Tyr Phe Glu Glu Ser Phe Lys Ala 
530 535 540 

Tyr Glu Arg Gly He Ser Leu Phe Lys Trp Pro Asn Val Ser Asp 
545 550 555 
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He Trp Ser Thr Tyr Leu Thr Lys Phe He Ala Arg Tyr Gly Gly 
5 60 565 J j 

Arg Lys Leu Glu Arg Ala Arg Asp Leu Phe Glu Gin Ala Leu Hi 
^ _ 575 . 5 8 o 5 *T 

Gly Cys Pro Pro . Lys Tyr, Ala -Lys Thr Leu Tyr Leu f Leu Tyr Ala 

Gin Leu Gltl Glu Glu Trp Gly Leu Al* Arg' His Ala Met Ala Val' 
oOS -610 cic 

Tyr Glu Arg Ala Thr Arg Ala Val Glu Pro Ala Gin' Gin Tyr Asp 
620 • 6 25 • i --^ 
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Met Phe Asn He Tyr lie Lys Arg Ala A £a Glu He Tyr Gly Val 



Thr His Thr Arg Gly H e Tyr Gin Lys 2a lie Glu Val Leu til 

Asp Glu His Ala Ajrg Glu Met Cys Leu Arg Phe Ala Asp Met gIu 
665 670 , 675 

Ala He Tyr Ser 

- CZ"\ \f A 1 a ' i>Via rn- 

69 5 700 



Cys. Lys Leu Gly Glu He Asp "Arg Ala Arg Ala lie Tyr Ser Phe 

6 80 \ 685 c 
Cys Ser Gin He Cys Asp Pro Arg Thr Thr Gly Ala' Phe Trp Gin 

Thr Trp Lys Asp Phe Glu Val Arg His lly Asn Glu Asp Thr He 

Lys Glu Met Leu Arg He Arg Arg Ser Val Gin Ala Thr Tyr Isn 

Thr Gin Val Asn Phe Met Ala Ser Gin Me£ Leu Lys Val Ser Gly 

745 Ten 
Ser Ala Thr Gly Thr Val Ser Asp Leu Ala Pro Gly Gin Ser Gly 

7 55 -» *• n 



760 



710 715 720 

Arg He Arg Arg Ser Val Gin Ala Thr Tyr 

725 7 30 735 

Leu Lys Val Ser 

740 745 750 

Gly 

Met Asp Asp Met Lys Leu Leu Glu Gin Arg Ala Glu Gin Leu 111 
770 775 -7Qn 

Ala Glu Ala Glu Arg Asp Gin Pro Leu Arg Ala Gin Ser Lys I 

785 790 7Q[ . 

Leu Phe Val Arg Ser Asp Ala Ser Arg Glu Glu Leu Ala Glu Leu 

Ala Gin Gin Val Asn Pro Glu Glu lie iln Leu Gly Glu Asp G^u 
815 820 82 

n, n Glu Val Leu Glu 

830 835 840 



Asp Glu Asp Glu Met Asp Leu Glu Pro Asn Glu Val Arg Leu IV 

Gin Gin Ser Val Pro Ala Ala Val Phe Gly Ser Leu Lys Glu 

845 850 * 



855 



<210> 55 

<211> 1598 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ f eature 

<223> Incyte ID No: 116462CB1 

<400> 55 



?^« at ^ ta ? gtcccttact tttactagcc acccccttcc cacttgcttc taataacaaa 60 

illllliii=~slii 

1= ssss ~ EEi i™ Eii OS 

ilH I™ =- ? ~" S=52 =3 
™ » ? -~ ssssss sss 

tggtattccc atatttatgg tgctggtgcc agtgtttgcc ctcaccatgg tagcagcctg 720 
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ggctttcatg agataccggc 
gaagatacat ttcactcacc 
catagccaga tttgaagtga 
tcttcatatt. ctaatgggaa 
ggcttagaac aaatgaggag 
aacccaaaga gagaaagatt 
tggcatgcca taaaggccag 
gatcttcctt gggaaattat 
tatatcgaca aagagtgaga 
cacatacgta tgcaaatatt 
aagctacttt gtggtgtcta 
agagatactt aaacagagca 
ttcagtctca tttattaaaa 
cccagcactt .tggtagcctg 
acatagcaaa acctcatctc 

<2<10> 56 

<211> 1432 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 1210462CB1 

<400> 56 

tgagaatgaa agtggatgcc cgcgaatccc ggaagtcaga ctgttttttt cagttccctg 60 
gaggcttttt gatactgatt cgcgtacacc tgttgtttga aagctctcag cgggacaatg 120 
ctgacccagc tgaaagcaaa atcagagggg aagcttgcaa aacagatttg caaagttgtg 180 
ttggatcatt ttgaaaaaca gtattccaaa gaactcggag atgcctggaa tacagtaagg 240 
gagatactaa catctccatc atgctggcaa tatgctgtcc tgcttaaccg attcaattat 300 
ccttttgaac tggaaaagga tttacatttg aagggctatc acacactctc tcagggatct 3 60 
ttacccaact atcctaaatc agtgaagtgt taccttagca gaactccggg ccgaatccct 420 
tcagaaagac accaaattgg aaacctgaaa aaatattatc tcctaaatgc tgcttctctt. 480 
ctcccagtgt tggctctgga attaagggat ggggagaagg ttctggatct ctgtgctgct 540 
cctggaggga aatcaatagc tctgctgcag tgtgcctgtc caggttatct tcattgtaat 600 
gaatatgata gtctgagatt gaggtggcta aggcagacgt tggaatcttt, catcccacag 660 
cctttgataa atgtaattaa agtgtctgaa ttggatggca gaaaaatggg agatgcacag 720 
cctgaaatgt ttgacaaggt gttagtggat gctccgtgtt caaatgatcg aagctggttg 780 
ttttcttctg actctcagaa ggcatcctgt aggataagtc aaaggaggaa tttgcctctt 840 
ctacagatag agctgttaag gtctgcaatt aaggccttac gtcctggagg gatacttgta 900 
tactctacat gcacgctttc caaggcagaa aatcaagatg tgatcagtga aattttaaac 960 
tcccacggta acatcatgcc tatggacatt aaaggaatag caaggacttg ctcccacgac 1020 
ttcacatttg ctcccactgg ccaggaatgt gggctcttag tgattccaga taagggcaaa 1080 
gcctggggcc caatgtatgt agccaaattg aagaaatcat ggagcacagg aaaatggcga 1140 
catgaatttg taaactgtgt ttatgtgtta ttatatttat atttctgaac ccagtacatg 1200 
ttaatattta aataattatg cagtaacttt ctctgggtct gtttggaatc ctattcagtt 1260 
aatactttag catcttagaa tctaggcttg agaattgttc aggcgtattt ttttcctaga 1320 
aatatatctg taacaatgat ttaaggtggt gcagatggtg tttgttctat attataaatc 1380 
tgctgtcttt gcttggcatt ttatagttaa attaattaga atatgtggtt tt 1432 

<210> 57 

<211> 2317 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1305252CB1 

<400> 57 

gcgggtgctg ctagcggagg cgccatattg gaggggacaa aactccggcg acacgagtga 60 
cacaaataaa cccctggacc cccttgttcc ctcagctcta agggccgcga tgttgtacct 120 



aacaactttg aaaaacttgc 
ctccacccct gctatcctgc 
ctgataccca ccccaaacct 
agtaaaggta ttgtttgaag 
ttatatattt tactaggact 
gagtcttcct gtcaccatag 
ctatgtgata ttagaggaag 
tgtggccttt . atttaatttc 
agagcatttt tactttttta 
atagtataat agtgatccct 
ggtttctgat aaaagggatg 
aatgtagtag gaacaaggga 
taataactga gactgggaga 
aagtgggaga ttgcttgagt 
tatttaaaaa aaaaaaaa 



tctctttcaa tactcccaat 780 
catgtctttc cctctctctg 840 
tgctgttcac agt.ctccaat 900 
gaaaactgaa gaaaagactt 960 
tttgatagaa attcagctac 1020 
gcaatacctt ttttcttagc 1080 
aaaggatttt tctttttaat 1140 
taactacgta cctgggtgcc 1200 
aaaaagcaaa tacatatata 1260 
atggagaatt ^aaaggtgaga 132 0 
atcttaactg aagaatttaa 1380 
gtgagcctta. taagaggacg 1440 
ggtggctcat gcctgtaaat 1500 
ccaggagacc agcctgggca 1560 

1598 
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agaagactat ctggaaatga ttgagcagct tcctatggat ctgcgggace gcttcacgga 180 
aatgcgcgag atggacctgc aggtgcagaa tgcaatggat caactagaac aaagagtcag 240 
tgaattcttt atgaatgcaa agaaaaataa acctgagtgg agggaagagc aaatggcatc 3Q0 
catcaaaaaa gactactata aagctttgga agatgca^fat gagaaggttc agttggcaaa 360. 
ccagatatat ga*dttggtag' atcgacactt gagaaagctg gatcaggaac tggctaagtt 420 * 
taaaatggag ctggaagcfcg ataatgctgg, aattacagaa atattagaga ggcgatcttt 480 ' 
ggaattagac actccttcac agccagtgaa caatcaccat gctcattcac atactccagt S40 • y 
ggaaaaaagg- aaatataatc caacttctca ccatacgaca acagatcata ttcctgaaa,a 600 
gaaatttaaa tctgaagctc ttctatccacf ccttacgtca gatgcctcta aggaaaatac/ 660 
actaggttgt cgaaataata attocacagc ctcttctaac aatgcctaca atgtgaattc* 72 0 
ctcccaacct-ctgggatcct ataacattgg .ctcgttatct tcaggaactg gtjgcaggggc 780 
aattaccatg gcagctgctc .aagcagt- tea ggctacagct cagatgaagg agggacgaag 840 " 
aacatcaagt ttaaaageca gttatgaagc atttaagaat' aatgactttc agtfcgggaaa 900 " 
agaattttca atggccaggg aaacagttgg ctattcatca tcttcggcac ttatgacaac 960 
attaacacagsaatgccagtt catcagcagc egaftcaegg agtggtcgaa ^agagedaaaa 1020 
caacaacaag tcttcaagcc agcagtcatc atcttcctcc tcctcttctt ccttat:catc 1080 
gtgetcttea tcatcaactg ttgtacaaga (aatctctcaa caaacaactg tagtgecaga 1140. 
atctgattca aatagtcagg ttgattggac ' ttacgaccca aatgaacctc gatactgeat 1200 
ttgtaatcag gtatcttatg' gtgagatggt gggatgtgat aaccaagatt gecctataga 12 60 4 
atggttccat tatggctgcg ttggattgac agaggcacca aaaggcaaat ggtactgtcc 1320 
acagtgeact getgeaatga agagaagagg cagcagacac aaataaaggt ggtccttttfc 13 80 
tttgatgaag aaataaactt cagctgaaga ttttatatag gactttaaaa agaagagaag 1440 
agaaagaaga aacaatgeat ttccaggcaa ccacttaaag gatttacata gacaatccta 1500 
taagatcttg aacttgaatt ttatgggttg tattttaata atgtaagtaa attatttatg 1560 
cactcctggt gtgctatgaa tattattcca gttagccttg gattatttca gtggccaaca 1620 
tatgeagaca tttgtactcc tcaaccattt tctcaaagta atgggcattc tatgatttag 1680 
acttcaagga attccaatga tgaagatttt aaggaaagta ttttatattc aacaggtata 1740 
ttctgetgea tgtactgtac tccagagctg ttatgtaaca ctgtataCaa atggttgcaa 1800 
aaaaaaaaaa aaagtcagtg cttctaaaaa gaatttaaga taatggtttt taaaatgect 1860 
ttataataag ctttgtttct ttgtgaaact aattcagcag gctgaaggaa atggttcatg 1920 
tgataatgtg ggctggtatc ctctagagta cctgggtaca taaacagaaa ctcctgtagg 1980 
taaaaagtaa tttgtgccat tagtctttct atgtttctgc atccagatag agtgcagttc 2040 
atgacgegag gggccggggg actgaaaggg gaaagggcgt taaagtgata catttttata 2100 
cccaaatgtg tttatttttt tttggtgcca agtaaaccct ttaaaaattg gccaattgta 2160 
attaaggggg gttaaaaata aaaggttttt ttaaaaaaaa tttaaaacaa aacaaaagaa 2220 
gaaaaaaaaa aaggggcegg gcccccccga tcttaagttg aagcctcccg ttggaacccc 22 80 
gcgggggaat tttaaatttt ccggggaccc egggtta 2 317 

<210> 58 

<211> 1774 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1416289CB1 

<400> 58 

tgttaagcta aaagttttga gcatgttctg gcaagattaa tctgatttta ttttaactat 60 
aagtcctact taataaattg taaatacatg gcatataatg taataattat atattttaat 120 
ttcaggtgcc ttgaatggct tctaaacaat ttgatgactc accagaatgt tgaacttttt 180 
aaagaactca gtataaatgt catgaaacag ctcattggtt catctaactt atttgtgatg 240 
caagtggaga tggatatata cactgctcta aaaaagtgga tgttccttca acttgtgcct 300 
tcttggaatg gatctttaaa acagcttttg acagaaacag atgtctggtt ttctaaacag 360 
aggaaagatt ttgaaggtat ggectttett gaaactgaac aaggaaaacc atttgtgtca 420 
gtattcagac atttaaggtt acaatatatt atcagtgatc tggcttctgc aagaattatt 480 
gaacaagatg ctgtagtacc ttcagaatgg ctctcttctg tgtataaaca gcagtggttt 540 
getatgetge gggcagaaca ggacagtgag gtggggcctc aagaaatcaa taaagaagaa 600 
ctagagggaa acagcatgag gtgtggtaga aagcttgeca aagatggtga atactgetgg 660 
cgttggacag gttttaactt cggcttcgac ctacttgtaa cttacaccaa tcgatacatc 720 
attttcaaac gcaatacact gaatcageca tgtageggat ctgtcagttt acagcctcga 780 
aggagcatag catttagatt acgtttggct tcttttgata gtagtggaaa actaatatgt 840 
agtagaacaa ctggctatca aatacttaca cttgaaaagg atcaggaaca agtggtgatg 900 
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aacttggaca 
tcaccagaaa 
tcatcagttg 
ctactgatat 
aatacagaag 
aagccaagaa 
tttagcatca 
atttaatttt 
tgtaaatttc 
^aaatgccttt 
tgattcctcc 
tttcttcctg 
tgctattttt 
ccttcaaatg 
acatggtctt, 



gcaggcttct 
aaaagaattg 
gaaacagtag 
tcacatcgaa 
attattctta 
aatatctgtc 
atagaaaatt 
tagcttaaac 
tttttatttg 
ttcagttggt 
ttttagtcta 
gttttggaga 
tttaatttga 
atcttgaatg 
ttctttgaca 



gatcttccct 
aaaataatcg 
cactttgaaa 
ggtgactaac 
tcctcat.tgc 
aaaccatttc 
gctgtaggta 
tttgtttcta 
gctaaaataa 
gtattgaaag 
atatctttcc 
ctaagctgat 
ctttcctagg 
agggaaaaat 
gtctgtacac 



ttatatatct 
tcacccagaa 
actttttagg 
aatgacaaag 
atttctatgc 
tgttagaacg 
aatctcacat 
ccttatgtta 
tctaaaagaa 
ctttccttta 
aggtcatact 
aaactttttt 
agtttaagaa 
cagtttgatt 
cttt 



gctgtaactt 
aatccagaaa 
ccagct.ttaa 
gccttatgaa 
atatgcgtaa 
atgtcaattc 
ttatctgcaa 
gtggacctca 
taatttggtt 
acattttcac 
tgtttttaat 
taaaacttaa 
cagtgaaagt 
ccaaggatat 



cttgtatata 
actgaagatt 
tttaatggcc 
ctgtacagac 
gaacatttta 
atgcttttaa 
caaaatatag 
gttatccatc 
ggccaattag 
ctgctcattg 
cattaaatat 
gcattgtcat 
tagcttcgca 
ttcttgcctt. 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

.1440 

1500 

1560 

1620 

1680 

1740 

1774 



<210> 59 * y- 

<<211> 1268 

<212> DNA 

<213> Hpmo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1558289CB1 



<400> 59 

taagtgaagc 

agcacagcag 

aagagcctgg 

gctcatctct 

aaggagtaca 

gccatcaagg 

gctacccgct 

gtagacttca 

ctgctagagt 

tttgatcact 

caccttggca 

ccctgagcct 

gcttcctaac 

caagagtgct 

tgctgagggg 

aaaaggtgaa 

gccccctccc 

taaagacaga 

ctgttccaga 

actactaaaa 

agggaaaaga 

ggggccgc 



ttctccattc 
tgactgacca 
cactgcaagc 
tcatagatga 
cgcacagccg 
tggctgtgct 
ttcgccagaa 
ccttcgaggc 
tggtggaaca 
tctctgaccc 
agatctgtga 
agcacattcc 
cctgctcacc 
tttgactctg 
tgaggcctct 
gtgaaacttg 
atgtgaacca 
agaggggaag 
gtgcaggaag 
tatgagaaaa 
gaacgaggag 



tgtaagcttt 
cataccccac 
agagaagaag 
gacaagcagt 
gccccaggcc 
gcaccgcaat 
gctgcggcag 
tgctgttctg 
ccacctcacg 
aggtctgctc 
cggactcagg 
accttgacaa 
tggcactaac 
agaccagccc 
ctcccactcc 
gatctctatt 
aggaaaggag 
gtggccctta 
aaggggctag 
aaaaaggacc 
gggagccggc 



ccgggaacat 
tctccaggac 
ctactgagta 
gaggtgctag 
cagcgcgtga 
ggctcctttg 
ggtgccatga 
gctggcctgc 
cccaagtcac 
acggccctct 
aagctgctag 
aatggttgac 
acttttcaat 
acccccaaac 
agccccagga 
tctcccataa 
gcacagccca 
gagacagagc 
ggcaggggag 
cagcgaaacc 
cccaagacac 



ccaaggcaag 
ccatggagtc 
agatggcggg 
atgagctcta 
tcaaggacct 
gccccagtga 
cggcacttag 
tgaccgagtg 
atggccgcat 
atgggcctga 
acgaagggaa 
tgagaaaaca 
cttcaggctt 
agctagtgga 
caggaaacag 
gggacttctg 
gagaacccct 
ttggacagat 
attctcatag 
ccaaggaagc 
aaagcagcaa 



actggcaccc 
cttcagctca 
tcgctctgtg 
ccgtgtgtcc 
gatcaaagtg 
gctggccctg 
ctttggtgag 
ccgggatgtg 
ccgccacgtg 
cttcactcag 
gctctgagag 
cagataatgg 
cattccttcc 
gaaggagcaa 
aactgcctga 
aaacagggaa 
ttggggatac 
gccagaggct 
gggaaataaa 
gcaacaggca 
aaaaagaggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1268 



<210> 60 
<211> 1331 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1577739CB1 



<400> 60 

gacatcttga ggcccacgtg gacctgagtt 
cagccacccc gtggagagcc tggcgctgac 
gctggtgaag gccgagatgg aaaagttttt 
gaaggggaag atttgctgct gctgccgggc 



ctggtgcagg atttaatcaa ctgaggagtt 60 
tgtggaagag gtgatggacg tgcgccgtgt 120 
gcagaacaag gagctcttca gcagtctgaa 180 
caagttcccg ctgttctcgt ggccgcccag 240 
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ctgtctcttc 

taagaaattt 

agctgccaaa 

gtggcagagc 

tgtctgcagt 

ggacgtcctc 

caccaggact 

gcaggccctg 

atacatttat 

agctcatttg 

gattccttgc 

gcagaaggtt 

ccaggaatgc 

ttggctccca 

ggcagcagag 

cctgtttgaa* 

ggccgaggag - 

aaatgttgtc 

cccagctaca 



tgcaagagag 
ggacacatcc 
accgcgccaa 
gtggaggagg 
gagtgcacpa 
aacactacgc 
cttgacttca 
tatcaggcta 
aatatataca 
ggttcaggcg 
cagggaggaa 
ctagatcc,tg 
gaacggcagt 
gctggaccag 
gcatctggga 
aggtctgg<jc 
ggaggatcac 
tctactgaaa 

9 ; ■ ' 



<210> 61 
<211> 3227 
<212> DNA 

<213> Homo sapiens 

<220> S 
<221> misc_f eature 
<223> Incyte ID No: 



ccgtctgcac 
ctgtctacac 
fccagagaag 
cgttccccca 
gct.tt:gtggc 
cacgacgcag 
agtgacagcc 
ggacgctctg 
cacagtctat 
cacttcaaaa 
ggggagggaa 
gcacagactg 
ttcccttccc 
agtgccctgc 
agtctctgag 
aggcgtggtg 
ctgaggtcag 
atgcaaaaat 



ttcctgtagc 
actgggcttt 
agacatcttt 
catctactcc 
agacgtggtg 
tcgceagacc 
r ccaggtggcc 
agctgtgcat 
atatttatat 
ccctccctgg 
cagggtgggt 
catcccatgt 
cagtggacgt 
ttgcctctgc 
taggcagggt v 
gttcaggcct 
gagtttgaga 
tagccaggta 



ataaagatga 
gagagtcctc 
cagtctctgc 
cacggctgtg 
•cgttccagcc 
caatccctct 
aggcctccag 
gtacatatat 
acactgtttic 
gggaggctgt 
tttctcactg' 
tcccatgctc 
ctaggtgggg 
tctccctttg 
cctcctggga 
gtaattccag 
'ccagcctggc' 
tagtggcagg 



agatgccttc 

agagggtatc 

aagggccaca 

tcctgaagga 

gcaagagcgt 

acatccctaa 

gaggcaccag 

acatatatag 

ctggccccag, 

ttcttctcag 

aagagagaaa 

ttctccgtcc 

acagggtatc 

tggggactca 

ggcaccccca 
cactttggga 
caacatgatg 
aacctgtaat 



300 

360 

420 | 

48Q 

540 

600 

660 

720 

780 

8'40 

900 

9$0 

1020 

1080 

1140 i 

1200 t 

1260 

1320 * t 

1331 



1752768CB1 



<400> 61 

tgcgagtacc tccatggtcc cggtggctgt 
aaacagcgat ttctcagatt tgcgggaaat 
tacccgggag cggggcctac tacacagtag 
ccctgcattg cctctggccg agctgcaacc 
ggatatggat gcctataccc tggccaaggc 
agcacatttc ctgcatggct gcaatagcaa 
atatctgtct ggagaaaaaa agaaggacga 
aaaaggacaa gtgaaaaatg aggcgcttag 
ccaagctcga gaacttgatg gatttggact 
ggacttggtt aaagaggcca ttgatgtgtt 
ttggggagcc tggttagaac tctgtaacct 
gtctttgcca gacacctgga tgaaagagtt 
gttgatagag gaggccctgc aaaagtatca 
ctcgtatatt gtttcccaaa ttgcagttgc 
cctctccatt tttaatgagc taaggaaaca 
attctccaac cttctttatg tcaggagcat 
cctctgtgag attgataaat atcgtgtaga 
tttacgttct cagcatgaga aagcagcctt 
tcggtatctt ggtgcctgga cactaatggg 
tgctgctatc caggcttata gacatgccat 
gtatggcctc gggcagacct atgaaatcct 
tagacgggcc caccagcttc gacccaatga 
ttacgagaaa ctcaatcaac tagtggaagc 
gggagatgtg gagaaaatgg ctctggtgaa 
gtcagaacag gctgcccagt gttacatcaa 
aatagtagaa cacttggagg aaagcactgc 
gtgcaaactg tgggatgaag cttcaacttg 
ccgggaagaa ggtaaggcct tactccggca 
tcctaccacc gaggtgcctg ctcccttttt 
ccccacacgc agagtttctc cactcaactt 
caagccagca cattgttaga cccatcttaa 
cgtctgttcc aaggacctca gctcttcttg 
cagcttgtat aattaccttc agaggccaac 



gacggcggca gtggcgcctg tcctgtccat 60 
taaaaagcaa ctgctgctta ttgcgggcct 120 
caaatggtcg gcggagttgg ctttctctct 180 
gcctccgcct attacagagg aagatgccca 240 
ctactttgac gttaaagagt atgatcgggc 3 00 
gaaagcctat tttctgtata tgtattccag 3 60 
tgaaacagtt gatagcttag gccccctgga 420 
agaattgaga gtggagctca gcaaaaaaca 480 
ttatctgtat ggtgtggtgc ttcgaaaact 540 
tgtggaagct actcatgttt tgcccttgca 600 
gatcacagac aaagagatgc tgaagttcct 66 0 
ttttctggct catatataca cagagttgca 720 
gaatctcatt gatgtgggct tctctaagag 780 
ctatcacaat atcagagata ttgacaaagc 84 0 
agacccttac aggattgaaa atatggacac 900 
gaaatcggag ttgagttatc tggctcataa 960 
aacgtgctgt gtaattggca attattacag 1020 
atatttccag agagccctga aattaaatcc 1080 
acatgagtac atggagatga agaacacgtc 1140 
tgaggtcaac aaacgggact acagagcttg 1200 
taagatgcca ttttactgcc tttattattg 1260 
ttctcgcatg ctggttgctt taggagaatg 1320 
caaaaagtgt tattggagag cttacgccgt 13 80 
actggcaaag cttcatgaac agttgactga 1440 
atatatccaa gatatctatt cctgtgggga 1500 
ctttcgctat ctggcccagt actattttaa 1560 
tgcacaaaag tgttgtgcat ttaatgatac 1620 
aatcctacag cttcggaacc aaggcgagac 1680 
cctacctgct tcactctctg ctaacaatac 1740 
gtcttctgtc acgccatagt tggctactct 1800 
ttaagcctta cctccatgta aagaacagca 1860 
tttctacaga tggcaacagc tccataggga 1920 
tgacagaatc ctggcaggaa cagacattat 1980 
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cttgccagtt agaagtactt ctgtctcact 
cttbttccct gaatgctttt ttttttttgg 
agcacaggca atgctacagg aacaaagttt 
aagtctgcca ctctaccctg agctggcagt 
agcaatggat acaaatgacc ttgctcttgg 
cagagattag gtttaaatgg aatagagcta 
tcttccaaaa cagttttggg ggggtcttct 
aactagagtt gaataaaaga agggcaaaaa 
cccttactga gggcaaaaga tggctatatt 
gagataggat tatagataat ttcacggaca 
aagaataaac tcacttcaag ctggtctgtt 
ttttccctcc catctgcaac agaatacatt 
ggaatcattc ttaUggtgct cttggagagt 
ggtaatatgc tataatgtct gctgtaatat 
gagctcaggg tccagacttt ;attcttattc 
aacatatgac acttcatctt atattaagga 
catttgttac taagccttct caggcaagct 
ccttttcaag gttcccatgg tccagaatga 
ataatttgtt ttaatgattt tgctacattt 
tctttaatat aactgttttt gtgtgcatag 
aacatgagat tacataggtg gttataatac 

<210> 62 
<211> 1865 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1887228CB1 

<400> 62 

gttctagatc gcgagcggcc gcctcccgga 
tcagaattgg gcccaggagt gaaggccaag 
ctcttgaact gggtggcatt gaaacccagc 
aatgatgaga aggtgctgca ggagctagac 
aagtcccaag gccccagcct ggaccccagc 
cccagcaagg cgacactcat tgaggccaac 
aagggcaacc tgggggccga gcgcatctgc 
ctgggcctgg acttcctgga gctgctgatg 
ctcatcaccc gctttgagcg ggagcagcgg 
ttcatgctat gcttca'gccg catcccgcgc 
ctgggcaact tcccggacac agcccagctg 
gcctcaatgt ccatcaagtc ctctgacaaa 
tttggcaact acatgaacag tagcaagcgt 
ctggatgcgc tgttggagat gaagtcgact 
gtgaaggtca ttgctgagaa gtacccgcaa 
ctggacaagg cgggctcagt gtccctggac 
cgaggcctag agttgacaca gagagagttt 
gagttcctga gggccaactc gcccaccatg 
caggaggcct ttgagtctgt ggtggagtac 
ggcctgttct tctccctctt tagccgcttc 
gtggaacagt ggaaaaaaga agccgctgcc 
ggggagcccc cagcacccaa gtcaccgcca 
tctgagctga aacggaggca gcagaaggag 
gccattgaag acatcatcac agatctgcgg 
cgccgcagtg cccgtcggcg tcccccgggc 
ctgtagccgc tatttctgca ggtggattct 
gctcaaggaa ggtggtcctc agctcggctg 
cctcaaacgg gctggtgcat cctcctcttg 
ccccaaatgc tgcttgcagc acccacccta 
cagcaggagc ctggcctgta acttataaag 
aggaccgtcc atggacctta tttttatatg 
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tatgtccaaa gagtggctat agatcttggc 2 040 
cccccaagaa agtccctJbtt atagcacttc 2100 
caatgctgct gagagtgaaa gaaaggagga 2160 
agggcactga gtaccctagg aagaagtcag 2220 
atttgctgag. catgatccct attctgatgt 2280 , 
tccat'ttgtt cttactctct agggagacaa 2340 • 
aaagctttca' aattggaagt aactttattc 2400 
taatctcaca gagcttggaa ctgctgatag 2460 
gttagctata ctcctaccaa agcaagcaag 2520 
tttggaaata acattggtga ttatacagac 2580 
ttaataaatt ttcaacgtaa ttgtctaCtt 2 640 
tttttcagcc tttatctaga tgaggtaaag .2700 
ctcaggcctg tgcatgtgtg tacagcagga 2760 
atttgcacag tagatgctat ggatcattct 2820 
ccagaatttt gtgttacgtt tttacctcct 2880; 
aggtttagaa tatctaatac gacttgaatt 2940 
gtatactagt tactggcctc cactgccatg 3000 
tgtttgattc ttaatfetttc tgtccctttt 3060, 
ggaattcaat aaaaaatgtg aacaataata 3120 
aaatcatata agtaaataaa aaaaaacaac 3180 
aaaagtgaga aaaaagc" - 3227 

% 



ggtcctcctg atgccctagg aagacgcgac 60 
aagcccatcc agactaagtt ccgaatgcca 120 
cagatcaccg gcactgtctt cacagagctc 180 
atgagtgatt ttgaggaaca gttcaagacc 240 
gctctcaaga gtaaggcagc ccagaaggcc 300 
cgggccaaga acttggccat caccctgcgg 360 
caagccattg aggcgtacga cctgcaggct 420 
cgcttcctgc ccacagagta tgagcgcagc 480 
ccaatggagg agctgtcaga ggaggaccgc 540 
ctgccggagc gcatgaccac actcaccttc 600 
ctcatgccgc aactgaatgc catcattgca 660 
ctccgccaga tcctggagat tgtcctggcc 720 
ggggcagcct atggcttccg gctccagagc 780 
gatcgcaagc agacgctgct gcactacctg 840 
ctcacaggct tccacagcga cctgcacttc 900 
agtgtcctgg cggacgtgcg ctccctgcag 960 
gtgcggcagg atgactgcat ggtgctcaag 1020 
gacaagctgc tggcagacag caagacggct 1080 
ttcggagaga accccaagac cacatcccca 1140 
attaaggcct acaagaaagc tgagcaggag 1200 
caggaggcag gcgctgatac cccgggcaaa 1260 
aaggcccggc ggccacagat ggacctcatc 13 2 0 
ccactcattt atgagagcga ccgtgatggg 1380 
aaccagccct acatccgcgc agacacaggc 1440 
cccccactgc aggtcacctc cgacctctcg 1500 
gcaggggtgt ggggccgtgg acaggctgag 1560 
gccgggcagc ccctcctccg ctgtggcccg 1620 
gccacagagg gcagcatcgc ccgccccttc 1680 
aagccccctc caaatagcca tacttagcct 1740 
tgcacctcgc ccccgcaagc cccagccccg 1800 
agattaataa agatgtttgc aaaaaaaaaa 18 60 
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aaaaa 

<210> 63 4 

<211> 1924 

<212> DNA 

<213> Honio sapiens 

<2 2 0> 

<221> mrsc_f eature 
<223> Incyte ~XD No: 



1865 



1988468CB1 



<400>. 63 
agctgccggg 
tccgaatctg 
tgcacgccag 
ccctcggtct 
atgrgccatga* 1 
cadcgcatgg 
cacgccttca 
acgagcggca 
gcgctggccc 
cagggaggct 
catcacccct 
atgaacggga 
acccccggcg 
ggcgcgggca 
gcccacgtcc 
gaagttctta 
ctctggctgg 
agcagagtga 
ttcggcgtga 
aaggtttgag 
aattgctgcc 
cctcccttat 
cctttccctc 
tcttgtggat 
ggaaaaattg 
ccatctcgct 
aaactggaag 
cactttcaag 
tagggtgtgg 
caacattttt 
gaaagctttt 
ttgttaattt 
aaaa 



cggtcctgcc 
ccccagcggc 
gaaggtcccc 
jtcggagcaga 
* accacgg^cg 
gcatggggca 
acgccctaat 
tcaggcatgc 
ccgcggccag 
ccctgccggc 
acccccacaa 
caaaccagca 
gctcgggcgg 
gcagcaacag 
ccgctgcaat 
tgtccttggt 
ggcaaaacga 
gctgttgact 
attaaaagaa 
aacttggaaa 
actttttttt 
gtagttgaaa 
actttctcca 
tttgtagatg 
cattagttgg 
tcattttttt 
attaaacaga 
ttggctgtcc 
aatgtcgtca 
gacactgtgc 
taggaaattt 
attctatttt 



gagctgtgag 

cgctgcgaga 

tctatgtgct 

aatcgcaaaa 

cttccccgac 

gttcccgagc 

gggcgagcac 

gatggggccg 

gtttaacaac 

cagcatgcag 

ccactacatg 

cttccgagat 

cagcagcacc 

cggcggcggc 

gctgccgccc 

gatagaaatg 

gtttgatttt 

cgatcgaaac 

acattccctt 

caaagtaaac 

cctgtttttg 

tatctagcta 

gtgctcaact 

gtttcaaatg 

ttgcatgaac 

aactatgcat 

tgtgggccaa 

cccccgccgc 

gtggcaaaca 

taatagttat 

tgacagtatt 

aatttgccaa 



ggcaacfirgag 
cctcggcgcc 
,gctgagccgg 
acggaaggac 
^ggcaccaatq 
ccccatcacc 
atacactacg 
gggactgtga 
tcccagttca 
ctgcagaagc 
ccggatttgc 
tgcaacccca 
cccggcggct 
agcggcagcg 
aatgtcatag 
ggtttggacc 
atgacggact 
cccggcgaaa 
agacacagta 
tataaacttg 
tttcgttttt 
acttggtctt 
gttagatatt 
actgaactgc 
ttcgaagggc 
ttgagtacag 
actgttctgg 
cccccccacc 
tttcacagat 
attcagtaca 
tttgtacaaa 
tgtcaataaa 



gggaaataaa 
gacatcgcga 
tcctggacgc 
tggaaatggc 
ggctgcacca 
accagcagca 
g^cgcgggcaa 
acggagggca 
tgggtccccc 
tcaacaacca 
accctgctgc 
agcacagcgg 
ctggcagcag 
gcaacatgcc 
acactgattt 
gcatcaagga 
tcgtgtgcaa 
gaaatcaaac 
tctcactttt 
tacaaattgg 
gtagccttga 
tttcgttgtt 
aatcttggca 
attcagattt 
agatattact 
actaattttt 
atcaggaaag 
cccatatgta 
ttttattttg 
tgaaaagata 
acattttttt 
aagttaagaa 



agggaacggc 
^cagagcgctt 
gacgagcccg 
^agaccatatg 
tcaccctgcc 
gcagccccag 
-catgaatgcc 
ccccccgagc 
ggtggccagc 
gtatttcaac 
aggccaccag 
cggcagcagc 
ctcgggcggc 
cgcctccgtg 
catcgacgag 
gctgcccgaa 
acagcagccc 
ccccaacttc 
cagatcttga 
ttttaaaaaa 
cattcaccca 
tgtttttact 
aactgcttaa 
acgagtgaaa 
gcacaaactg 
aaaatatgct 
tcatactgtt 
cagatgataa 
tttctgtctt 
ctactgtgtt 
gaaaaaatac 
ataaaaaaaa 



60 

120 

180 

240 

3p0 

360 

42 0 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1924 



<210> 64 

<211> 948 

<212> DNA 

< 2 1 3 > Homo s ap i ens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2049176CB1 



<400> 64 

ggcctggtcc 

agatggtgtc 

catattattc 

t.gtactggat 

cttggtttcc 

ataccaaagg 

aaagggagat 



gcagcgccct 
ctggatgatc 
atacaaagct 
tgtttttgct 
cctgtactat 
agcaagtcta 
tgatgattat 



gcgcccaccc 
tccagagccg 
gtgaaaacaa 
ctctatactg 
gagctgaaga 
atatatagaa 
attgtacaag 



gccccggacg 
tggtgctggt 
aaaacgtgaa 
tgattgaaac 
ttgcttttgt 
aattccttca 
caaaggaacg 



tggggcccaa 
gtttggaatg 
ggaatatgtt 
agtagccgat 
catatggctg 
tccacttctt 
aggctatgaa 



gcccccgtga 60 
ctttatcctg 120 
cgatggatga 180 
caaacagttg 240 
ctttctccct 300 
tcttcaaagg 360 
accatggtaa 420 
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actttggacg 

gccaaggagc 

aaggtgatga 

gtaaaccagc 

aaacagatga 

ttcgaagatc 

acgggtcact 

gtcaaatatc t 

ttcaggattt 



gcaaggttta 
aataactgaa 
gcctgtggga 
ccccagtgaa 
agaagcagag 
gcaaagcatg 
aaaatacaaa 
ccaagacaga 
acacattaaa 



aaccttgcag 
cgtttaagaa 
caaagaccat 
tcagcaggtt 
gggccatatt 
aaatctgtga 
gtgaagaaac 
ttatgctaaa 
atgattattt 



ctactgctgc 
gcttcagtat 
accaacctct 
atggaattcc 
cagataatga 
aaaccaccaa 
gaccacaagt 
tacatcgact 
aaattgtggc 



tgttactgca 
gcatgattta 
accagaagca 
actgaaagac 
gatgttaaca 
aggccgcaaa 
■gtatttttag 
'tcatcttcta 
agtgatgg 



gcagtaaaga 
acaactatcc 
aaaaagaaaa 
ggagatgaga 
cacaaagggc 
gaggtgcggt 
tcatctacac 
acatgatata 



480 
540 
600 
660 
720 
780 
840 
900 
948 



<210> 65 

: ,<211> 2035 " 

<212> QNA . 

<213> Homo sapiens 

<220> 

<22-l> misc__f eature 

<223> Incyte ID No: 



2686765CB1 



<400> 65 

gaccgtggcc" 

aggagtgact 

aacatcagca 

tgggacttgt 

aaactctcaa 

gcaggccccg 

gtccttggcg 

gcgcaagacg 

ctccattccc 

ctgggataag 

cccctcctct 

gcagaaccca 

tgtgattgag 

ggggcatggg 

cgagggttac 

ggagaagcct 

tgcagggctg 

ctccccgctg 

ggagtggttc 

ggacatgttt 

ccacacgctg 

gcagctgcgt 

caatctgccg 

gcgggaaacg 

catccacgag 

cctccctggc 

ccccgtccgc 

gcatccatcc 

ccggaagcag 

gcgggagaag 

gaagcaggag 

gcgtcgaaag 

ggcccggcag 

tttccctcag 



ctgaccgcca 

gcacggagtg 

tcagcaggac 

gaggctccgg 

gacctgggct 

gacacgctga 

gatgatgtga 

gacaagtatg 

gtggacgtgg 

tggctgtcac 

ctcagagcca 

ggaaagtttg 

aaggacctgc 

caacaggacc 

tgccaggccc 

tttggtgcct 

gaggccattc 

gcgcaccgcc 

atgtgcatct 

ttctgtgaag 

ggctcagtgg 

aacctgcccc 

gtgacagaag 

cggggggagc 

gagcgccggc 

ctcaagagcc 

agagccagtg 

cttccctcac 

gagaaggagc 

gagcggcaga 

aaggagcggc 

gcagatgggc 

gacgcttact 

ccaagaacag 



aacccccgct 

atgcaggaca 

aggctcctga 

tagctgtcgt 

ccacgtccag 

gttacttgga 

gctccatggg 

gcttccttgg 

ctcggcagcg 

ggcgattcca 

aagcctggca 

aggagctgga 

accgccagtt 

tgtaccgaat 

aggcccccgt 

gggtgcagat 

agctggacgg 

acctgcagcg 

tcgcccgcac 

gcgttaagat 

agaagctgcg 

agcagtgcat 

cactgattga 

tgcagtatcg 

ggcaacagcc 

gaggctcccg 

ctgggcctgc 

ccactggcaa 

ggcagaaaca 

agcaggagaa 

agaagcagga 

ccccaggccc 

tctgacctct 

gcctggccca 



tgcccccaag 

agtgacaggt 

ggacccctca 

gaccgtgacc 

cctgggacct 

ctccgtgagc 

ctcagattca 

gggcagccag 

ggagctcaaa 

gaaggtgaag 

gtacctgtct 

acgggctcct 

ccctttccac 

cctgaaggcc 

ggctgcggtc 

ctgcgacaag 

ggagatcttt 

gcagcgcatt 

cctgccctgg 

catcttccgg 

ctcctgccaa 

gcaggaagac 

gcgggagaat 

gccctcacgg 

acccctgggc 

ggcagctgga 

cccagggcct 

tagcaccccc 

ggagaaggag 

agagcgagag 

gaagaaggct 

ccatgatggt 

gccctggggc 

aggtgccacc 



ccgggaacca 

gggcatggag 

ggccctggca 

ccagctccgg 

ggcatctctg 

ctcatgtctg 

gagataaacg 

tactcgggca 

tggctggaca 

ctgcgctgcc 

aatagcaagg 

ggggacccca 

gagatgtttg 

tacaccatct 

ctgctcatgc 

tacctcccag 

tttgcactcc 

gaccctgtgc 

gcgtcggtgc 

gtggccctgg 

ggcatgtatg 

ttcctggtgc 

gcagcccagc 

cgactgcatg 

ccctcctcca 

ggggccccgt 

gtggtcactg 

ttgggttcca 

cggcagaaac 

aagcaggaaa 

caaggccgga 

ggggacaggc 

tggac tgcat 

ccctagcacc 



cagtggcctc 

ctgccgcagc 

caggcccctc 

agcctgctga 

ggcctcgagg 

ggaccttgga 

ggctggccct 

gcctagagag 

tgttcagtaa 

ggaaggggat 

aacttctgga 

agtggctgga 

ctgctcgagg 

accggcctga 

acatgcctgc 

gttactacag 

tgcgccgggc 

tctacatgac 

tgcgtgtctg 

tcctgctgcg 

agaccatgga 

atgaggtgac 

tcaagaagtg 

ggtcccgggc 

gcctcctcag 

ccccgccgcc 

ctgagggact 

gcaaggagac 

aggagaagga 

aggagcgaga 

agctttcgct 

cctcagccga 

ggcccccctc 

ttgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2035 



<210> 66 

<211> 766 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 3215187CB1 
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<400> 66 - 
cttggaggga 
tggtgggatt 
tctctggtgc 
gagttaagga 
cggttgagga 
tccatccata 
-ctgaatgtcc 
. tcagaactag 
aaggaggcet, 
atgatetaca 
tgggagaggc 
gaggatacgt 
agcagctggg 

<210> 67 
<2~11> 2503 
<212> DNA' 
<213> Homo 



gtgcggtcct 

ggggtccgcc 

tgtgcgctgc 
cagattttaa 
acatcgagcg 
gcctcttctg 
ctccactttt 
cortacgaccc 
ggtgtaagct 
cttt'agtgag 
ctgagacgga 
gagatataga 
agccgagtta' 



sapiens 



ctagggaggc 
ggccat^ggcc 
gctcatcttc 
gagccccata 
catctgcttc 
ca'ttatgttc 
ctatcacttc 
aqcggtggtc 
ggccttctat 
ctcttiaaogc 
gaggtgcatt 
cccggcaggc 
accct^cgtg 



atcgggctcc 

ttcactttcg 

ttcgccatct 

gaccagtgca 

cttctgcgaa" 

ctgtgtgcgc 

t;ggaggtatt 

atga^tgccg 

etcctctcct; 

aaagaccatg ' 

tctgctggtg. 

agtcagactg 

tctgtgtcac 



taggggcttc 

ctgcgttctg 

g^gcacataat 

atcctgttca 

agctggtgct 

aagagtggct. 

tccactgtcc 

acactttgag 

tcttctacta 

cacatca'toa. 

actggaggag 

aatgggagct 

cctgtt 
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ttggcgtgtg 60 
ctacatgctg 120 
tgcctttgat 180 ' 
tgcgagggaa 240; 
gccagaatac 300 
cacgctgggg 360 
agcagatagc 42 0 
ttactgtcag 480* 
cctttactgc 5.40 
gagactgaga 600 
ggaccagaat 660 
ggaatcacgc 720 
-766 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3500375CB1 



<400> 67 
tgtcctcggc 
agtgctgctg 
caagagaaga 
gggggggaag 
gattgtggtg 
gaccaactat 
ggatticcatt 
tgcagctact 
tctcaatggt 
tcttcagcga 
accacgaatt 
ggagggtcac 
gtcagtggaa 
caagactgat 
ggtagatgtg 
cttacaaagc 
agccagagaa 
tcccactttg 
attcaaagga 
tatgacactg 
ccagccagta 
tcgaacaccc 
aaaagacctt 
tcaacgagaa 
tgcaattagt 
agactgggac 
gccatggctt 
tctgaagtat 
agaactcagc 
attggacagg 
ccatttctgg 
ggtcacagat 
cctgtcatcc 
agtctgtaat 
ttacttaaaa 
gattataagg 
aaaatactat 
agggatgtga 



ggcctgggtg 
ctgttggttc 
aggaggcagg 
atggcggacg 
aagggagacg 
gttgtttggg 
ttatttctac 
gaaaatattc 
gaagcgtcaa 
tctactcaag 
gaggatgaag 
aaagaaggga 
aaaattgctg 
ctagatgatg 
acccgagata 
acaggaaaga 
gaagggcgtg 
cgcaccaaac 
aaagaagaaa 
aaatctgtaa 
ccaagaccag 
attatcataa 
ctacaggacc 
aatgaaactc 
gttacagtac 
cgcgttgtag 
ttgcctgatg 
gatgaagttc 
tatcacaaac 
tacatggtaa 
aaattgagac 
tgatctttta 
atataagcaa 
ggaaattgta 
tttgtttctt 
ctctctgagc 
gttatgaatg 
ctattttgaa 



gctactgccc 
gtcgcggcgg 
cgcggcggca 
tgcttagcgt 
aagtgatctt 
ggactggaaa 
ttaataacgt 
ctgtggttag 
catcggcaag 
tcaaacgagc 
agtgtgtgcg 
ttgtacagac 
caatcaaagc 
acataactgc 
ttgtcagcag 
atttttccaa 
cacctgaaca 
agcctatccc 
cggaaggctt 
cggagggtgc 
tttctcaagc 
ttcctgcagc 
tgaaatttgt 
taatacaaag 
cttatagagt 
ccgtttttgt 
gatcaccagt 
gtctggatcc 
gtcatttgga 
agcataaatc 
tcaagcttta 
taagacctta 
actttttggc 
tattttgata 
tagaaaataa 
catcttctga 
gtattaaatt 
tgaatcagaa 



ctgctgctgt 
cgaaggagga 
gcggcggcgc 
cctgcgacag 
cggggagttc 
ggaaggccaa 
gcacctttct 
aagacctgat 
tatagacaga 
tgcagatgaa 
ccttgataaa 
tgaacagatt 
caaaattatg 
ccttaaacag 
agagagagta 
gaacattttt 
gcgacctgcc 
agctgcctat 
caaaattgac 
atctgcccgg 
aagacctccc 
taccacctct 
cccatcagat 
aagaaaagac 
agtagaccag 
gcagggtcct 
tgatatattt 
aaatgttcag 
tagaccagtg 
gcacttgaga 
tgaatttatc 
tttgacgctt 
ttacaactat 
.gaagtttttt 
atgcaggtta 
tttttcattg 
ttagtctctg 
tgtcaacttg 



cgtaggcgag 
ggaggaagag 
cccgagccgg 
tacaacatcc 
tcctggccca 
cccagagagt 
catcctgttt 
cgaaaagatc 
agcgctccct 
gttctagcag 
gagagattgg 
aggtctttgt 
gctaagaaaa 
aggagttttg 
tggaggacac 
gcaattcttc 
ccaaatgcag 
aacagatacg 
actatgggaa 
aagactcaga 
ccaaatcaga 
ttaataacca 
gaaaagaaga 
cagatgcaac 
ccccttaaac 
gcatggcagt 
gctaaaatta 
aaatgggatg 
ttcttacggt 
ttctgaatta 
aagaacttaa 
tgtgct tcaa 
ttttttaata 
ctccattggt 
taaatgtgtg 
ctctataatt 
gaacatccaa 
tatgtacact 



gacggctgtt 
ggcgaggcga 
cggaggcgag 
agaagaagga 
agaatgtgaa 
actacacatt 
atgtccgacg 
tacttggata 
tagaaatagg 
aagcaaagaa 
ctgcccgttt 
ctgaagctat 
gatctactat 
tggatgctga 
gaacaactat 
aatctgtaaa 
cacctgtgga 
atcaggaaag 
cctaccatgg 
ctcctgcagc 
agaaaggatc 
tgcttaatgc 
aacaaggttg 
cagggggcac 
ttatgcctca 
tcaaaggttg 
aagccttcca 
taacagtatt 
tttgggaaac 
tttggctcct 
aaatgaagaa 
ggagatgata 
ttagccttct 
taaattagca 
tatatttaga 
ctttttactg 
aaccaagcaa 
atatctacac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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ttactcatta tttaaaaaga ataatgaaaa atctagatca attcttcaat ttgattgaac 2340 

tgttcagcct tttcaagatt tctttattta caaatgatta catttaaatg aatgtacatt 2400 

cttctcactg actttggtga ttttgaaacc tagaatgatg tgtttctatc tgtaatatct 2460 

ttccatttga aaaaaatctc aaaacacaga ttaaaaccac aaa * 2503 

<210> 6 8 * - *' - * ' ' 

<211> 541 . . : ' , • - 5 

<212> DNA - 
<213> Homo sapiens 

<220>. ; *' ^ , ~' *-■' 

<221> misc_feature 

<223> Incyte ID No: 5080410CB1 



<400> 68 

atggcgtcca 

gargagctgt 

atccgtctct 

ctgcacctcc 

atggagatca 

atcttgaagt 

aacgttcagg 

atgctgccac 

cctcactccc 

9 



tgcggga^ag 
gggcgccgcc 
gcttccatgg 
cgcgccgcac 
tccagcticgc 
cctttccgga 
atattttggg 
tggagtgcca 
ccgggtgaag 



cgacacgggc 
cagcatcgcg 
cctctcgtcg 
ggtggacgag 
cagcctcgac 
cacaggctcg 
agaacttaga 
gtacttgaac 
cattttcagt 



ctgtggctgc 
tccctgctca 
geagtgaagc 
catcctattt 
tcggacccct 
cttaacctgg 
gaaaaggtgg 
aaaaacgccg 
taaagcggaa 



acaa;caagct 
cggccgcggt 
tcaagttgct 
tgccaatgaa 

gggtgctcat 

agctggagga 
gtgagtgtga 
ctgacgaccc 
acccaagagc 



gggggccacg 

catcgacaac 
actcgggacg 
gggcgcccta 
ggtcgccgac 
gcagaatccc 
agcgtctgcc 
tcgcgggacc 
gccacgctgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

541 



<210> 69 

<211> 937 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5218248CB1 



<400> 69 

gactacgacc aaaacaaagg agcggcggcc 
tcggcgcgct tctcagccgg gccgccgacc 
gagaaacacc tgttcaacct gaagttcgcg 
tgcgataagg aggaaaaggc cgaaaaggcc 
atggaagttg cgaggataca cgccgaaaat 
ttcttgagaa tgagtgcgcg agtcgatgca 
atgggcaagg tgaccaagtc gatggctggt 
accatgaatc tggagaagat ttctgctttg 
ctggacgtcc agacgcagca aatggaagac 
ccccagaacc aagtggatat gctgctccag 
aacatggagc tgccgcaggg ccagaccggc 
caggatgaac tgtctcagag actggcccgc 
ctctgaggtt tcctggccat agccaccctt 
tataccctag aaactctgaa cacgccagaa 
cagccccctc cacataaatt aagaaattca 
gttctgtata gctcgtaatg atggtatttt 



gggagcggac ttaccttacc ttctctgcct 60 
caaaggagcc gtccgactat gtctaacatg 120 
gccaaagaac tgagtaggag tgccaaaaaa 180 
aaaattaaaa aggccattca gaagggcaac 240 
gccatccgcc agaagaacca ggcggtgaat 3 00 
gtggctgcca gggtccagac ggcggtgacg 3 60 
gtggttaagt cgatggatgc gacattgaag 420 
atggacaaat tcgagcacca gtttgagact 480 
acgatgagca gcacgacgac gctcaccact 540 
gaaatggcag atgaggcggg cctcgacctc 600 
tccgtgggca cgagcgtggc ttcggcggag 660 
cttcgggatc aagtgtgacg gcagaacccg 72 0 
tgaaatgctc tctgtgtgtt agagagatac 780 
tgctgaaatg cccttctacc tttgggttta 840 
gtatttctgc actcttagct gtattctaaa 900 
tatagca 937 



<210> 70 

<211> 823 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 058336CB1 

<400> 70 
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cccatcacgg 
gccgggcgaa 
tcccggtgcg 
tttgaactae 
tcagcactgg 
cattcctgcg 
gcagctcttg 
aa<?gtacccc 
^ggaggtggcc 
tggagaagga 
tgctgttggt 
ttaacaattt 

gggg&gaata 

actctttaat 



cgtagtgcaa 

tggtcggcag 

agaagaagac 

cctggdaacc 

gccctgtact 

tcctttacaa 

aaggatgttg 

cctctccctg 

tataacttac' 

gtttcagagc 

gtcttctcat 

accaaagaga 

cttattttca 

aaattgctta 



gctaaaatta 

ctgcgaggcc 

cccggcttga 

cctgcccagg 

tgggtgatgg 

gcgccaagtc" 

tgggaaat^a 

tggaagaaat 

ttgtcaacaa 

aggccaaccg 

tccttgggctt 

tattgatatu 

gtgcatcatt 

ctgatattat 



accctcacta 
aagagagacc 
gagtgagatg 
ggacttgafcc 
ttacgttatc 
tgtatitcagc 
cacatacaga 
cataaagcgg 
ctgtgaacat 
agcgataagt 
gtttccaaaa 
gaaggaattt 
actgttccag 
cttaaaaaaa 



aagggaataa 
ccaggacaca 
gcgtttaafcg 
gaagtgttcc 
'aacatagcac 
agtaaggccc 
ataaacaata 
tcagagt.ttg 
tttgtgacat 
accgttgagt 
ggacaa'agag 
gggaggagga 
attcctat^a. 
aaa 



gcttgcggcc 
cacagctgcc 
attgcttcag 
gtcctggcta 
ctgtagatgg 
tggtgaaaat 
katacgatga 
taattggaca 4 
tgcttcgcta 
ttgtgacagc 
caaaatacta 
aaagaaacct 
tggatggcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

823 t 



<210> 71 ■ . 
<211> 1033 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1511488CB1 



<400> 71 

gcgccagggg 

cgaggagagc 

acaagcgggc 

agtccgctgc 

gccccggggt 

tcacggggag 

cctgctgtgg 

aatgatagac 

cacaattatc 

atccttaaca 

tgcacaggct 

tgtcatcagc 

ttttattttg 

actgaagtat 

atgtcgtgaa 

cacagcctca 

tttcatcacc 

taaaaaaaaa 



ttccagctgc 
tgcggggaca 
agcatgctca 
ggccccgggg 
ctcccctgct 
attcaccgag 
acagggcgtt 
accgcaagaa 
gcctgctttg 
agttggaact 
aaagctaaat 
aacagtagaa 
gtaacaagaa 
accatgcatt 
cacctgtgta 
ttcctgcctt 
tttgattcat 
aaa 



acgtcccagg 
acgcctgtgg 
gggcggtcgg 
cgcctctcga 
actccagcgg 
tccccacgca 
tcaaatcgat 
acaaagctcg 
ctgtgatagt 
tggcaaagaa 
gatattctaa 
gatgggaaaa 
gcacaatgtc 
caaaaagtgg 
accactgcca 
ttctcagcca 
tttgcctgtt 



ctctccagcg 
ctgggtccgg 
gagcctactg 
ggccacgcga 
cggggccccc 
gcgcaggcct 
ggaggagatc 
agtgaaagct 
gtcagccaaa 
agctaagtgg 
gtgacaaagt 
atagaatatt 
ttttttattt 
acaaaactgt 
aagtgaagat 
ttacctccca 
tttgaacttt 



cgcggcaggc 
aggtgcgggt 
cgccttggcc 
cggcccgcac 
agcaattctg 
tcgcagttcg 
ccgcctcgga 
tgttacataa 
agggctgtag 
cgtgaagaag 
gttcacctga 
taccaaaata 
ttatttttta 
atacagtctg 
gtagaatatt 
aacatagcag 
atataaatgg 



cggggcggga 
gcggcgcggg 
gcgggctaac 
cggctcttcc 
ggccccaagg 
acaagaaaat 
tcccgccaga 
tgattggact 
aacgacatga 
ctgcattggc 
ataccatccc 
tctgccatgg 
gtaaactttt 
atagatattt 
ggcaacactt 
tttttctgag 
atttatacat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1033 



<210> 72 

<211> 1622 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



1638819CB1 



<400> 72 

ggcacttccg 

gtgtgtgttt 

acaggcgctg 

aaggtggaga 

cagattctgc 

caggtggcgg 

gtgcttgggg 

gctgacgtga 

gcagatgtga 



gcggcgcgct 
cctttcagcg 
agcacctgtg 
gttaccggtg 
ggcccggcct 
tgcagaaggt 
tagatcttct 
ctgacccgag 
ttctgagcga 



gcaggcgcgg 
tcaagggttc 
gctgacccga 
tcgaagcgcc 
tcgggtgtta 
caacgccgca 
tcacatattc 
aacctcacag 
catggcgccc 



ggaacaccaa 
cacactgttg 
catctcaggg 
ttcaagctcc 
gactgtgggg 
ggcacagatc 
cccctggaag 
agaatcctcg 
aatgccacag 



tggcggggta 
ggagtcgctg 
acccatttgt 
tggaggtgaa 
cagctcctgg 
ccagctctcc 
gagcaacttt 
aggtgcttcc 

ggttccggga 



cttgaagctg 60 
caagaatcgg 12 0 
gaaggctgcg 180 
cgagaggcac 24 0 
ggcctggagt 300 
tgttggcttc 360 
tctgtgccct 420 
tggcaggaga 4 80 
cctcgatcat 540 
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gacaggctca tcagcctgtg cctgaccctt ctcagcgtga ccccagacat cctgcaacct 600 - 
ggggggacat tcctttgtaa aacctgggct ggaagtcaaa gccgtcggtt acagaggaga 660 
ctgacagagg aattccagaa tgtaaggacc atcaaacctg aagccagcag gaaagagtca 720 
tcagaagtgt acttcttggc cacacagtac cacggaagga agggcactgt gaagcagtga 780 
ggatttcttg tgccattttc ataatggtca ttagctcctt ttaagctaga aacgtiiagcct 840 
gagctcctga agagttcctg ggagatttga gctgattttg gagatggagc aggacaagtg 900 
gggagtctct ctctctcttt ctctctctct ctttttaacc aaaaagagat gacaaaacta 960 
agttcagggg ccatggaaaa tgaaaaagtc cgctatattg tgatttggga agagaaagtt 1020 
atcaagagaa agaggtgagg atggaaggat ggagaaaaac agactgtggg aaggatcaga 1080 
aggaatccgc cgaggcaggg atgggtgtgc ccatgtgtgc cttgacggga cttcatctta 1140 
tagactgtta aactgtcaca cacaaacagg ctttccaccc ctgctctgag agcaccacgc 1200 
acagatttcc agttcttagt gtggctgttt aaagtagaaa atctgggggc tgggtgaggc 12 60 
cactcatgcc tgtaaaccca gggctttaga aggctgaggc tgggggattg cttgaagtca 132 0 
ggagttcaag accaacctgg gcaacatagc aacacccccc atgtctacaa aaatgaaaaa 1380 
ccaaaaagca aaccaaaaga aaaatctgaa atttccatct ggggattaac ttctgtcttt 1440 
ctggtgaaca atatagcaat tcacgcattc ttcaagcagc aaaagttc.cc ggaacaatta 1500 
gggaagacgt atggtctgaa tttatccagg cagtgggtct gctttggtt.t ttgctggaaa 1560 
tttatatcag tgtctgggct cccaagaaca taaatgtaat tgccaaagca aaaaaaaaaa 1620 



<210> 73 
<211> 2449 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1655123CB1 

<400> 73 

cgttgccggg ctctccggaa ggagacgtgg cggcggttgg gccggtgata cccgggcgct 60 
ttatagtccc gccgcctcct cctccaccct cccctcctcc tcctctcctc ctggagcaga 120 
ggaggttgtg gcggtggctg gagaaagcgg cggcggagga tggaggaagg aggcggcggc 180 
gtacggagtc tggtcccggg cgggccggtg ttactggtcc tctgcggcct cctggaggcg 240 
tccggcggcg gccgagccct tcctcaactc agcgatgaca tccctttccg agtcaactgg 300 
cccggcaccg agttctctct gcccacaact ggagttttat ataaagaaga taattatgtc 360 
atcatgacaa ctgcacataa agaaaaatat aaatgcatac ttccccttgt gacaagtggg 420 
gatgaggaag aagaaaagga ttataaaggc cctaatccaa gagagctttt ggagccacta 480 
tttaaacaaa gcagttgttc ctacagaatt gagtettatt ggacttacga agtatgtcat 540 
ggaaaacaca ttcggcagta ccatgaagag aaagaaactg gtcagaaaat aaatattcac 600 
gagtactacc ttgggaatat gttggccaag aaccctctat ttgaaaaaga acgagaagca 660 
gaagaaaagg aaaaatcaaa tgagattccc actaaaaata tcgaaggtca gatgacacca 720 
tactatcctg tgggaatggg aaatggtaca ccttgtagtt tgaaacagaa ccggcccaga 780 
tcaagtactg tgatgtacat atgtcatcct gaatctaagc atgaaattct ttcagtagct 840 
gaagttacaa cttgtgaata tgaagttgtc attttgacac cactcttgtg cagtcatcct 900 
aaatataggt tcagagcatc tcctgtgaat gacatatttt gtcaatcact gccaggatct 960 
ccatttaagc ccctcaccct gaggcagctg gagcagcagg aagaaatact aagggtgcct 1020 
tttaggagaa ataaagagga agatttgcaa tcaactaaag aagagagatt tccagcgatc 1080 
cacaagtcga ttgctattgg ctctcagcca gtgctcactg ttgggacaac ccacatatcc 1140 
aaattgacag atgaccaact cataaaagag tttcttagtg gttcttactg ctttcgtggg 1200 
ggtgtcggct ggtggaaata tgaactctgc tatggcaaac atgtacatca ataccatgag 1260 
gacaaggata gtgggaaaac ctctgtggtt gtcgggacat ggaaccaaga agagcatatt 1320 
gaatgggcta agaagaatac tgctagagct tatcatcttc aagacgatgg tacccagaca 1380 
gtcaggatgg tgtcacattt ttatggaaat ggagatattt gtgatataac tgacaaacca 1440 
agacaggtga ctgtaaaact aaagtgcaaa gaatcagatt cacctcatgc tgttactgna 1500 
tatatgctag agcctcactc ctgtcaatat attcttgggg ttgaatctcc agtgatctgt 1560 
aaaatcttag atacagcaga tgaaaatgga cttctttctc tccccaacta aaggatatta 1620 
aagttagggg aaagaaaaga tcattgaaag tcatgataat ttctgtccca ctgtgtctca 1680 
ttatagagtt ctcagccatt ggacctcttc taaaggatgg tataaaatga ctctcaacca 1740 
ctttgtgaat acatatgtgt atataagagg ttattgataa acttctgagg cagacatttg 1800 
tctcgctttt tttcattttt gttgtgtctt ataaactgac tgtttttctt tgcttggata 1860 
ctgtgattcc aaaataaatc tcatccaagc aagttagagt ccagcctaat caaatgtcat 1920 
aattgttgta cctattgaaa gtttttaaat . aatagattta ttatgtaaat tatagtatat 1980 
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gtaagtagct 
catgtaaaat: 
tgctctctgc 
gtcctcaccc 
acttttatct 
tgtagacagt 
tgtctatccc 
caatatttat 



aatgaagtaa 

atgtaaattc- 

aagtagttaa 

ttgttaatct 

attacttttt- 

tggtgatgc.t 

ttgtacttigc 

tttaaagtga 



agatcatgaa 
tagtacctga 
actagaaact 
caagaaactc 
tcttcttatg 
tctgagactc 
ctactgtaat 
ataaaatgtc 



gaaagaaatt 
aatcctttca 
gggcacatgg 
ttatttataa 
agtatgttta 
agaatggttt 
atggatttca 
f cacaagcaaa 



gataggtgta 
acagattttt 
tagaggctca 
taggttgctt 
ctctcagagt 
actctaacaa 
cttctgaaca 
aaaaaaaaa 



PCT/US00/19948 

aatgagagac 2 040 
atatagcaac 2100 
catgggagtt 2160 
ctctctcaga 2220 
atctatctga 2280 
aacactgtgc 2340 
gtttacagca 2406" *. 

2449" * 



<210> 74 ' . v 
' <211> 1689. 

<212> DNA 
t <213> Homo sapiens'' 

-<220> • : 

<221> misc_ feature 
*,<223> Incyte' ID No : . 2 

<400> 74 

aagtaatctt agggattgtg 
c'ggagacaga ggcggaagct 
aacctgtagg cctgcaggag 
tggactctca gaagaaagac 
agaacatcct ggctcaggag 
cgagccgaca gaaggcaatt 
gggagcacgt agaggccatc 
cccagaggaa acggacacaa 
tggccatgga gaaacgcaga 
tgcagcatct ggcggaggtt 
agcttgacgg ggtgtttcag 
acaagctgca gaggtatcag 
tgttccctga ggctgaggct 
ctcgacccca ggagcagagt 
agttctccaa ggctgttggt 
gaggacagca gcatggagaa 
catcattaca ggaaagactg 
caagctctgg catctgtgga 
cttcatatgc tggcaggaac 
gagctgggac tggtttgaac 
tcctatgatg cagacatggt 
gttatatgtg ggaagataat 
tgctaaggtt cttgcggtga 
tgtacagcct agtgttaaca 
atcactggtt tcagatgttc 
gtcatccact attgtatgtt 
tttctgtttt cggattggag 
atcaactcac gtatttggat 
aaaaaaaaa 

<210> 75 

<211> 2489 

<212> DNA 

< 2 1 3 > Homo s api ens 



<220> 

<221> misc__f eature 

<223> Incyte ID No: 2800717CB1 

<400> 75 



f ■ . . 

i 



553926CB1 



ggaaggcagc 

gcagccctag 

gaggcagaac 

aagctgctct 

gacactgcta 

gcagctaagg 

aaaattggcc 

ctccgggaag 

gcagtccaga 

tctgcagagg 

aaacttggaa 

accttcctcc 

gaggcagaga 

acaggagaca 

ctacaacctg 

agatcctaga 

tgaactcctg 

ttaaaatccc 

aactattaat 

acagggtgtg 

gaatttaatt 

tttattcatg 

accagaatta 

ttcttggtat 

atttattata 

ttgtttcatt 

aagatgtacc 

ctttatttgt 



( 

tgaactcggc 

aggtcctggc 

tgccagccaa 

gcagccagct 

agggtctcga 

aacaatggaa 

tcaccaaggc 

cctttgagca 

accagtggca 

tgagggagcg 

acctgaagca 

agcttctgta 

atcttccaga 

ccatggggag 

ctggagatgt 

aaaggcctct 

agttcagctt 

tggatctctc 

acagatactc 

cagatgggga 

caaggaggag 

gatccaaatg 

taacagtgag 

ctttttgtgc 

ttcttttcaa 

gacctctagt 

ttttttgtca 

tttctcaaat 



gcctggaaag 

tgaggtggca 

gatcctggtt 

tcaggtagcg 

ccccttggct 

agagctgaag 

cctgactcag 

gctccaggcc 

gctacaacag 

taagacaggg 

gcaggcagaa 

taccctgcag 

tgataaaccc 

agaccctggt 

aaatttgcca 

gacttccctc 

gatttctgac 

tcagttgtgt 

agaagccaat 

gggggtactg 

gagaatgttt 
tttgttgagt 
ctcatctgac 
cttatctaaa 
agattcagag 
gataccttga 
actcttactt 
aaatatttaa 



atggaggcag 

ggcatcttgg 

gagtttgtgg 

gatttcctgc 

tctgaagaca 

gccacctaca 

atggaggaag 

aagaaacaaa 

gagaagcatc 

actcagcagg 

caggagcggg 

ggtaagctgt 

cagcagccga 

gtgtccttca 

tgacttcctg 

acctcccaac 

tacatcccag 

atttgttcat 

aacatgacag 

gccttgggcc 

taggcaggtg 

cctttctttg 

tgttttagga 

acatttctcg 

attggctttt 

tctttcccac 

ttatcagatg 

ggctaaaaaa 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1689 



tZtZrZZ*?? acgcggttag gaagtgtgtg catacgtctg aaccctaaat ggttctcagt 60 

5"? ? * tctcctccca ctgggtggag tagggccttt aagagcagct ggaatgcagt 120 

^" tgat ^ agCgtacca ^ ttgttgcctg tctgaacctc tgccagtcct ggagactggt 180 

gccctgagct ccaaccagcg ggcctcatcc tacaccctca ccaccgcaac ?tc?cacccg 240 
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agcaagaagc agctcccaga gagaaagaac 
gctgcacagg ccgaaaagtt ccagcaccct 
agccccagcc cgatgccttc ctccacacca 

: gaggccatcc gggacaactc acaggtgaac 
ctggtgaaca tgctagacgc tgtgcaggag 
agtttggagg. gctccgtgaa gggcatccag 
gcctccacca gcaacacggt gagcaagctg 
acgcgcgcgg tcaaagagcg catggatagg 

. aaccacgccc agctcctccg acgcaaccat 
. gagatccctg ccagcgtgtt tgtgaaacag 
gagcttccgg atgaaaacaa atccctgcfag 
gatgatgatt tgccccacga tgaggaggcc; 
gaaagfcaggg cagagaaaat aaaaagatcc 
gcattttctc gccagaacat cgagaaaaag" 
gtagagagga gagagaagat taagaaatct 
ggaaaaagct cccccttcaa ggtttctccc 
ggagaaagcc atgcagaaaa tgagaccaag 
ccaaatgacc aggaagagga gtcctttgca 
gctctggtgg aaggggaaat tgcagaggag 
aactcgggga tggacagcaa catcgacttg 
gtggccctgg aacaggcaca gaaggtadgc 
gaggcggagc gctccgatgg ggaccccgtg 
tcctgagctt agagccaccg tgccatcctg 
agaacaaact cttgcacatc tccagcacga 
gtaatcattg accatatagt catagtaaga 
acctgtcacg gactccactt ttaatttgct 
agattgaaga agaaactgag cagttgaaaa 
aacagaggat aattaagata ctagtcatga 
aagcttgctg aatagtgtac cggtaatata 
ttattattct ttaagtgttg atatataata 
atatgtttta agaagaaaac atctacgaaa 
cttgctagaa accattgttt tattgcaaac 
gctaatgaag taagatacgt agtaaagtca 
ggaaaatgag agaaacagga aacaaagaat 
tatgtgcttg aaagtaaaat acttacaata 
aatactaaag aagttcgtat atcctggaat 
ctaatatttt taataataaa cattttatag 
aggacattac aaatatttct ttaaaaaaa 



gttcccacct gcctagccat gggagaggac 300 
gggtctgaca tgcggcagga aaagccctcg 3 60 
agccccagcc tgaacctagg gaacacagag 420 
gcagtcacgg tgctcacgct cctggacaag 480 
aaccagcaca agatggag;ca gcgacagatc ; 540 ; 
aatgacctca ccaagctrctc caagtaccag 600 
ctggagaagt cccgcaaggt- cagcgcccac 660f 
cagtgcgcac aggtgaagcg gctggagaac 720 
ttcaaagtgc tcatcttcca ggaggaaaat 780 
cccgtttccg gtgccgtgga agggaaggag 8*40 
gaaaccctgc acaccgtgga' cctctcctca 900. 
ctggaagaca gtgccgagga aaaggtggaa 960* 
agcctgaaga aagcggafcag cctcaagaaa 102 0 
atgaacaagc tggggapaaa gatcgtatct 108*0 
ctcacgtcaa atcaccagaa aatatcctca 1140 
ctcactttcg ggcggaagaa agtccgagag 1200 
tcagaagacc tgcctageag tgagcagatg 1260 
gagggtcatt ccgaagcgtc cctcgccagc 132 0 
.gctgctgaga aggcgacctc cagggggagt 13 80 
actattgtgg aagatgaaga ggaggagtca 1440 
tatgagggta gctacgcgct* aacatccgag 1500 
cagcccgccg tgctccaggt gcaccagacc 1560 
tgctgtgctc aagcgggcag ccagggctga 1620 
ctcacccact cctgcgttcc tgtccaggca 1680 
cacacgagac caggctttac catgaaagcg 1740 
cttaggttct atctctgtag aatgtctcca 1800 
atgctaatct ctttgactta gtcagaaaaa 1860 
aaagtgattcattctttttt gtcattccat 1920 
ttgtatttcc accgtactct gtgaatctaa 1980 
tacataaata tgtaagctaa acatataact 2040 
ggtaaaaaga gatgatcagt tggttgttta 2100 
gaaggaaaaa tgaagagatt ataaaagtca 2160 
ggactattca aaaagtaaga aagaaaattt 2220 
gccgaaaaga atgaaaacag agaaaaaatg 2280 
gtagcttaac tatttcactc tttaaataaa 2340 
aacatgtcat cttcaaaata tttttatttt 2400 
tgttaaagct gtatttt tct taataaataa 2460 

2489 



<210> 76 

<211> 898 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5664154CB1 



<400> 76 

cctctggcga 

cgtcagaacc 

aggacgtgcg 

ggccaagtgt 

aggcccagga 

tgtcaaccag 

cttggcccta 

ccgcgttagg 

acaggcaagg 

ggaaaagctg 

tcaccatgac 

gttaataggg 

cgttttgatg 

ttttgaccca 



tgacaacagc 
gccgtcgctc 
ggccaggtac 
tgcggcgacg 
gtaatggagt 
gaaaatgatg 
cctttgaatg 
cagcccatcc 
atgagagagg 
agggaaaagc 
catcacgatg 
agacccctgc 
ttacctattt 
gtttgtaagt 



cacacgtgat 
gctcccttct 
ggccccgaaa 
cacctcacgt 
ccaaagagga 
aaaaagatga 
ttagtgaata 
tgcagtatag 
agaatatgga 
agttgagtca 
agttttgcct 
ttcctaaact 
cttgtgggtc 
ttctgtcagc 



cggccaacac 
cggcagtggt 
gtaggaagcg 
cgagaatcgg 
acgagcgtta 
aaaggagcaa 
ctgtgtgcct 
atgggacata 
aaggattggg 
tagtctgcgg 
tatgccctga 
tacacatttg 
tcctattacc 
aggagagttt 



tgagtcttac 
acctgctccc 
gagggggagc 
gaggaggaga 
aacaatctca 
gttgctaata 
agaggaaacc 
atgcataggc 
gaggaggtga 
gcagtcagca 
atcctgatgg 
tggtgtacct 
agcttctaaa 
tacc.tattgc 



ctcgttgtgg 
ggtgtccctg 
aggtttgcgg 
ctgcaaggat 
tcgtggaaaa 
aaggggagcc 
gtaggcggtt 
ttggagagcc 
gacagctgat 
ctgatccccc 
tttccctgaa 
ttgtcgtaaa 
tgaatgttgt 
atggaaagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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gctcattata tattgtgaag ttaataaaac agttttaaaa agcaaaaaaa aaaaaaaa 



898 



<210> 77 
<211> 1236 ■ 
* <212> .DNA t ' 

<213> Homo sapiens 

<220> , v V 

<221> misc^f eature / 

<223v Incyte ID No:" 017900CBlf 



<400> 77 - 

cctcggtact 

gcagcgatgg 

gcgacgc^tc 

cgactcccgg 

tccagcgacg 

tgggtgatGc 

gccgagatga 

tcgggcctgg 

cgccaccggc 

gctcctgttc 

gctgccgccc 

cccctcccat 

cacactggac 

cagacgacta 

atcccaccga 

gccacagtcc 

cagcgagacc 

tgttcctggc 

taacgcagta 

aacccctgca 

aataaatgtc 



gacctctgca 

acggtcgggt 

gctggaggaa 

agttcatcgt 

ccctgaaggt 

cctacatcaa 

agcgagtctt 

gggcgggtgc 

gccctccctc 

cctcccgcgt 

ctgggcctca 

gtacatttgg 

agaccaccca 

catcattctg 

cagactgctg 

ctggacagtg 

tatttcctcc 

agccaaatcg 

ttctgagttt 

ttctgctctg 

agacattatt 



gagccgggtg 
gcagctgata 
cccgattccc 
gcagacgggc 
gacgttcctc 
gaaggagagc" 
1 tggatgggag 
tctggggagg 
cgcgcctccc 
agtgcttgcc 
ctctggagcg 
acgctgtcct 
ctgccgccgc 
cccacagacc 
ctcctagtga 
atccagacag 
ccacccccag 
agtctctcat 
gcaactgtct 
ctctaataca 
gaaaagaaaa 



i gagcccattg 
aaggccctcc 
tttcccgaga 
tcctacatgt 
atcacccgcc 
'cccctcctca 
gaggacgagg 
gtccgctgtg 
tccccctcga 
tttgttccag 
agccgccgcc 
gcgctccagc 
tgccaagccc 
tgcgctgcca 
tctggactca 
ctggccgccc 
aaacctcttg 
tttctcttgt 
ctctgatgtg 
ctacctggag 
aaaaaa 



facgtccagcg 
tggccttgcc 
4 cgtttgacgg 
tcgtggacga 
tcacagggcc£ 
atgattaccg 
acttctaggc 
ttactggccg 
gccgccgcga 
gaatagcgct 
ctctccttcc 
tgcaagctgg 
tctcctcccc 
cagccatcgc 
cctcggaggt 
cccaagggat 
tgttcttgcc 
ggaccagtta 
tgccttttgt 
aaagtctttt 



aacgacgtga 

gatccggcct 

ogataccgac 

gaacacgttc/ 

cgccctgcag 

gggctttctg 

cgggagaccc 

ccgccagggt 

tgtcccctgc 

ccaggctcct 

agccagccag 

gctcctgtta 

accagactgc 

catccatcgc 

atctgggctg 

ctgtcacctt 

taggcccagg 

gttttgccca 

tcaacacagt 

ccttattttc 



60 r " 

120 

180 

240 

300 

360 

42 0 ' 1 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1236 



<210> 78 

<211> 1634 

<212> DNA 

<2 13> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 035102CB1 



<400> 78 

gtttgactcc cgtgcggtgc ggcccagcag 
tactcccgcc gtcactgccg ctgtccaacc 
cacccctcgg cccgtgtacg cgctctgcac 
agagagcagg gggctcccgg tcccgcaggc 
tgagacccgg gccccgaccg cctcgcagga 
ggaggagggg gctgcggagc ccgccctcac 
atccttggca aggcgcaggg cctaccgccg 
gttcctcctg gtgaaagaca agaagaagag 
cgttattgga gacttgaaga ttctgttccc 
gcggtatgtc tttggttttg agctgaaaca 
gatcaacaaa ctaaaacctc tggaggagga 
ccccagattg ggtctgttaa tgatgatcct 
cagggaggcc caggtctggg agatgctgcg 
tttcctcttt gggtatccga agaggcttat 
cagttacagg cgggtgcctc acaccaatcc 
aagcaacctg gaaatcagca agatggaagt 
ggaaccgcag cactggccag tgcagtaccg 
cagagccaag gccagagctg aagccagtat 
ccacctctgg tgagggttgg tgaaaagttg 



ccacaaagct cccgctgcca ttgctccttg 60 
cctcccccgg ggcttgcgcg gcggctccca 120 
ctgcctgccc gaaaacatgt tgcagacacc 180 
cgagggggag aaggatggcg gccatgatgg 240 
gcgccccaag gaggagcttg gcgccgggag 300 - 
ccggaaaggc gcgagggcct tggcggccaa 360 
gctgaatcgg acggtggcgg agttggtgca 420 
tcccatcaca cgctcggaga tggtgaaata 480 
ggacatcatc gcaagggccg cagagcatct 540 
gtttgaccgc aagcaccaca cttacatcct 600 
ggaggaggag gaggatctgg gaggagatgg 660 
gggccttatc tatatgagag gtaatagcgc 720 
tcggtcgggg gtgcaaccct caaagtatca 780 
tatggaagat tttgtgcagc agcgatatct 840 
accagcatat gaattctctt ggggtccccg 900 
ccr.ggggttc gtggccaaac tgcataagaa 960 
tgaggcccta gcagacgagg ccgacagggc 1020 
gagggccagg gccagtgcta gggccggcat 1080 
gccagtgggt ccccgtgagg acgaactact 1140 
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gtcctgagtc 
ctttaaggat 
gctgtttgta 
ggttttgtaa 
aacattggaa 
gttccaggg/g 
tataaaagaa 
aaaattaaaa 
tggaaaaaaa 



ataagtaata 
ttagattttg 
aatgagattg 
aatggatgga 
ggaaggctgt 
aaaaaaatag 
gaaagactgg 
tgcaatgagt 
aaaa . 



tgggtggggc 
tatcttatgt 
gtctactttt 
agaactttgt 
actttgatgg 
ctggttggtg 
catgtacctt 
tagcagtata 



gagggtctta 
tttgtaacat 
tcctgtagga 
atttatactg 
tttgaaggaa 
tctagccccc 
catttgctta 
ccctggcaca 



tttctgtaga 
ttaataatta 
ttttattgta 
tgattttgaa 
ctcagcagta 
caacactttt 
gctatttgag 
cttaataaat. 



aatcgtgtga 
ctgttaaaat 
gagttttgct 
cagattatgc 
tgatgatctg- 
gtctgttgtg 
tatctagaga 
taaacatttg 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1634 



<210> 79 

<211> 1258 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte J ID No: 259983CB1 



<400> 79 

tcccggccag 

gaggaggagg 

tattgcgcgt 

accaaagatc 

ttgacaacat 

ccagaagaaa 

tgtatttgca 

gacttccaga 

gtcctgcttg 

gtggtagatg 

actctatctg 

cgtcataatt 

tacagtaacc 

ccagtttttt 

atctataaag 

agcaataaga 

tccactcagg 

gaat tttgaa 

gctcttgtta 

cccggggttc 

gctacagacc 



cggtagcaafc 
gagcgcttga 
ggaacggctg 
cctgtctctg 
ctgagtggac 
agatgtttgg 
gagctaagtc 
gctgttttgg 
tgaaaagatg 
caagggctgg 
ggaacaggat 
ctgatgctca 
agtcagatga 
cctttttaga 
gccgttttgg 
aagcagctgc 
agtggtgact 
aaaccacaaa 
cccaggctgg 
aggtgattct 
cggctaattt 



tgcagaactg 
gggggactgg 
cttttggaag 
attcaccaaa 
agctaattga 
ttttcacaag 
ctccagttct 
attgcatgag 
gaagaagttg 
acccagtcta 
aaaaagcaac 
cagtaccacc 
cggctcagat 
tctcacttac 
ggaagtcctc 
tgagaagcca 
gaggttttta 
gctacaaact 
agtgcagtgg 
cctgcctcag 
tttagtttta 



caggagacta 
cctggcgtgc 
gcacaacttc 
cagcttcaac 
cctaagactt 
ccaaagatgt 
cgattcactg 
actcgttcag 
ccagcaggat 
aagactacat 
cagatcagta 
tcaagtgcct 
acagagatgg 
tggaaaagac 
attgacacac 
gaggagcagg 
tgtagaaggg 
gaccctcttt 
cgtgatcttg 
cctcccaagt 
gtagagacgg 



^ctttctaga 
actccgcacc 
ctgaatggac 
cctgaaacca 
cagaccaggc 
accgaagtat 
acagtaaacg 
gagacatctg 
caaaaaaaaa 
tgaaaccaaa 
aactgcagaa 
ccccagctca 
cttctggttc 
agaagatatg 
atctcttcaa 
ggccagagcc 
gaacaaaaaa 
tttttttgag 
gctcactgca 
agctgggttt 
gggttcacca 



caaggcagtt 
tcggggacat 
catgactccc 
ggacgagaag 
ctactattgc 
agagggctgc 

ctatgaaaag 
caatgcctgt 
ctggaatcat 
gaaagtgaaa 
ggaatttaaa 
atctccttgt 
taacagaaca 
ttgtgggatc 
gccttgctgc 
tctgcccatc 
aaaaatatct 
acggagtttt 
acttccgtct 
ataggtgccc 
cgttgggc 



60 s 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

i'2 00 

1258 



<210> 80 

<211> 2223 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc„f eature 

<223> Incyte ID No: 926810CB1 



<400> 80 

aaaagccgcc 

cggccctgcg 

agttctttgc 

agcatctgcc 

gtgcagctga 

taccccggag 

agacgcgctc 

ctcctctccc 

ataccctatc 

tgaatgcacc 

gtcctataat 



gcggctgcct 
aagcagctcc 
cgcctgcaag 
gtccaccagc 
ttgcagagct 
aggagctttc 
tcgtttcaga 
acctgcagat 
gtcgtcagtc 
cttgccgcct 
ggccagcatg 



taggaacggc 
ttcgggcagc 
gcggagacca 
ggagcacagg 
ggacccggat 
tcacggaggg 
gttcgtcccc 
taagcttttc 
atggctagca 
tgggcccatt 
gcagacagaa 



gctgcctcgt 
cccgggtcgc 
gaaggcggaa 
ccatcaaagc 
ctgcgacccc 
cactggttgc 
tgctgagata 
taaaaagcct 
tcattgcacg 
ccatgctgag 
acatgaagtt 



ctctgctacc 
ttagcggcca 
tccacagctg 
cgcatctgaa 
ctgtggacag 
agaggctgga 
ggaaggcaga 
aggcatcttc 
tgtcggtaac 
gtccctgggg 
gttctcgggg 



cctggttggg 
aggaggcttc 
gcgacgcggg 
cttgaattct 
aggttgaccg 
agtgaaataa 
gccacctcct 
ttatattcag 
agccggcggc 
agaagtctcg 
agggtggtgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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cagcccaagg ggaagaaacc. tttgaaaact ggctgaccca agtcaatggc gtcctgccag 720 
attggaatat gtctgaggag gaaaagctca agcgcttgat gaaaaccctt aggggccctg 780 
cccgcgaggt catgcgtgtg cttcaggcga ccaaccctaa cctaagtgtg gcagatttct 84 0 
tgcgagccat gaaattggtg* tttggggagt ctgaaagcag tgtgactgcc catggtaaat 900 ? 
tttttaacac" cctacaagct, caaggggaga aagcctccct ttatgtgatc cgt'ttagagg 960 
. tgcagctcca gaacgctatt caggcaggca -ttatagctga gaaagatgca aaccggact^c 1020 
gcttgcagca gctcctttta ggcggtgagc tgagtaggga cctccgactc agactfcaagg ,1080 
attttctcag gatgtatgca' aatgagcagg agcggcttcc caactttctg gagttaatca 114ci 1 
gaatggtaag ggaggaagag gattgggatg atgcttttat taaacggaag cgtccaaaaa 1200 ' 
ggtctgagtc aatggtggag N agggcagtca gccctgtgge atttcagggc tccccaccga J.2 60 
tagtgatcgg cagtgctgac tgcaatgtga tagagataga ^ga^accctc gacgactccg 1320 v 
atgaggatgt gatcctggtg gagtctcaag accctccact tccattctgg .ggtgcccctc 13?b 
ccctcagaga cagggccaga cctcaggatg aagtgctggt cattgattcc ccccacaatt 1440 
ccagggctca gtttccttcc accagtggtg gttctggcta taagaataac ggtcctgggg 1500. 
agatgcgtag agccaggaag cgaaaacaca caatccgctg ttcgtattgt ggtgaggaag 1560 
gccactcaaa agaaacctgt gacaacgaga gtgacaaggc ccaggttttt gagaatttga 1620 
tcatcactct ccaggagctg accca-tactg agatggagag gtcaagagtg gcccctggcg 1680* 
aatacaatga cttctctgag ccactqtaag ggaccacccc caggtttcag tgaaccctta 1740 ' 
cctatattca gcatccagta gtgggaaaac tggggtgggg gtgggggtgg/gacttctaac 1800 
tgcatgaatt aatccacaaa gcggctatct tttggggtgg agtagaaagg gtcttggata 1860 
ccagcacatt ggagggagat agcctgacct ctgtccttgc tccttctccc tgcagcctac 1920 
gggtctgttt tctgtgtgtg cccatttcct tgacagcttt attctttgtg aaagtggtat 1980 
aatttattgt taaatattcg aacaataaaa aaggtacaaa aagtgaagta caaattaccc 2040 
aaatctctcc acccttatat aatcattgtc aaccctttga tgagtgatat ttccctatac 2100 
ctatgtaccc agatagatat atgcatagat aaaagtgatg aaatataagt gctgttctat 2160 
ctgtattttt tcaccaaaca atatatgttg tgagcttcta tgacaataaa tatatatatc 2220 
act 2223 

<210> 81 
<211> 1370 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1398816CB1 



<400> 81 

cccacgcgtc cggccgggag gactgggtgc 
tgagaggttt tctcgctcta gggagattct 
tgtttccctt ccttcatatg aggaagatca 
ggcaccattc gtacccgttg gaatagcggg 
taaactgaag agcaggggaa atactaaaat 
agcccaaggc tttgttgtag gagcaacgac 
attctgggca aaacctaagc cttagaagaa 
ttgctttagt tagatgtctt attattaaag 
tgtatgggtt tagatggtaa catggcattt 
atttgcttgg tgaccgaatt actagtgact 
aagttaactt aaacatgtca cctaaatgca 
aatttttaag atgaacttag ttctaaagaa 
gtttgctgca gaatgtcaga tattttggat 
attcaacttt tgtctgcctg ttttgtggac 
agtgcatgga atataacttg taaagcttcc 
aaccaaatcc agaaagctta aacaatagag 
actgtaaaca tgagaataac ttaaggattc 
tttttgctgc tgatatatta gaataatttt 
tttaagcact cacgcaaagg taaatgaaca 
ccataagaat tgtaaacatt gaactgaaca 
tatgagcaag ctggtttggc cagacagtat 
tatcacactt gtgaaattct cttgtctaat 
tatatgtatt gttattaaag taagtgaccc 

<210> 82 



gcctgcaggg atcggaagcc ggttggggtg 60 
tcaagcaatc actatgtcaa cagacacagg 12 0 

gggatcaaaa ctcattcgaa aagctaaaga 180 

ttttgcagca attgttgcat atggattata 240 

gtccattcat ctgatccaca tgcgtgtggc 300 

tgttggtatg ggctattcca tgtatcggga 3 60 

gagatgctgt cttggtcttg ttggaggagc 420 

ttacctatta ttgttggaaa taaactaatt 480 

tgaatattgg cttcctttct tgcaggcttg 540 

agtttactaa ctaggtcatt caaggaagtc 600 

cttgatggtg ttgaaatgtc caccttctta 660 

gataacaggc caatcctgaa ggtactccct 720 

gttgcataag agtcctattt gccccagtta 780 

tggctggctc tgttagaact ctgtccaaaa 840 

cacaattgac aatatatatg catgtgttta 900 

ctgcataata gtatttatta aagaatcaca 960 

tagtttagtt ttttgtaatt gcaaattata 1020 

taaatgtcat cttgaaatag aaatatgtat 1080 

cgttttaaat gtgtgtgttg ctaatttttt 1140 

aattacctat aatggatttg gtnaatgact 1200 

acccaaactt ttatataata tacagaaggc 1260 

ctgaatttgc attccatggt gttaacatgg 1320 

atgtcaaaaa aaaaaaaaaa 1370 
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<211> 1541 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ip No^ 1496820CB1 

<400> 82 { : * ■ .. i 

gtgttaagct gacaaaatct gtacagaata ?tttaattctt ccttttattt ct?gtgataca 60 
aagattgtgt ttcttttcat agcaacatga 'accgtgaaga ccggaatgtg 'Ctgcgtatga 120 
aagaacggga aaggcggaat caggaaattc agcagggcga agacgccttc ■ ccacctagct 180 
ctcctctctt tgcagagcca tacaaagtta ctagcaaaga agataagtta tcaagtcgta 240 
ttcagagtat gcttggaaac tacgatgaaa tgaaggattt cataggagac agatctatac 300 
caaagcttgt tgcaatdccc aagcctacag taccaccatc agcagatgaa aaatctaacc |3 60 
caaatttctt tgaacagaga 1 - catggaggct, ctcatcagag tagcaaatgg actccagtag 420 
gacccgcace cagcacttct cagtctcaga aacggtcctc aggcttacag agtggacata 480 
gtagccagcg gaccagcgca ggtagcagta gtggcactaa cagtagtggt cagaggcacg ,540 
accgtgagtc atataacaat agtgggagca gtagccggaa aaaaggccag catggatcag 600 
aacactccaa atcacgttct tccagccctg gaaaacccca ggctgtttct tcattaaact 660 
ctagtcattc caggtctcat gggaatgatc accatagcaa ggaacatcaa cgctccaaat 72 0 
cacctcggga ccctgatgda aactgggatt ctccttcccg %gtacctttt tcaagtgggc 780 
agcactcaac tcaatctttc ccaccctcat tgatgtcaaa gtccaattca atgttacaga 840 
aacccactgc ctatgtgcgg cccatggacg gacaggagtc catggaacca aagctgtcct 900 
ctgagcacta cagcagccaa tcccatggca acagcatgac tgagctgaag cccagcagca 960 
aagcacatct caccaagctg aaaatacctt cccaaccact ggatgcatca gcttctggtg 1020 
atgtgagctg tgtggatgaa atcctaaaag agatgacgca ttcatggcct ccccctctaa 1080 
cggctattca tacaccatgc aaaacagaac cttccaaatt tccttttcca actaaggtaa 1140 
gtaaataaaa tgtatctttc ataatgtaag aaaactctaa atggcttgac taaaatcata 1200 
tggattaaaa attgtcttgc cattcctatt ctagtgggag acagacagta aataagtgaa 1260 
taaatagata aattcagata gtgacaactg ttatgaagat aattagcagg gtaatggaac 1320 
tgagagcatc ttggatcaag aggtattaag aaagctttga aggcaatatg cgagagagat 1380 
ttaaaagaca ttaatacagc cggacacggt ggctcactcc tgtaatccca gcactttgga 1440 
aggctgagcc aagagactct cttgaggcca ggagtttgcg accagcctgg tcagcatggc 1500 
aagaccctgt ctctaccaaa aattggaagg aaaaaaaaaa a 1541 



<210> 83 
<211> 1372 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc__f eature 

<223> Incyte ID No: 1514559CB1 



<400> 83 

cggctcgagc agctgccgaa gtcagttcct 
gaggcaccga ggcactcaga ggaggcgcca 
acccatgcgg cagcaaggcc tgccgccgcc 
gccgcgactg tgatgcgcta atggcgggct 
tcgactttgt caccgagaca ccactggagg 
ttggcctgcc caagctctac cttcccactt 
gtggctcctt cccatcgctg tcacaggcgg 
cagacctgaa ttctttttca tttgagaagt 
gagtgggggc atcatcaaaa actttggagt 
gaccctgaag tgagcacagc ctagggctga 
atacccccct ctgtcttgtg aaggcagggg 
cctgccctca tggcccctct: gacctgcact 
cctctttggc tcccctgtac cttttgagga 
tctgcaattc ccctctgctg ctgtccctgc 
gctccaggtg gctctgaggt gcctgtccca 
ggaagggaca cacaagaaga agggcaccct 
cgccccctcc tctagctgtg ggggtgaggg 



tgtggagccg gagctgggcg cggattcgcc 6 0 
tgtcagaacc ggctggggat gtccgtcaga 120 
tcttcggccc agtggacagc gagcagctga 180 
gcatccagga ggcccgtgag cgatggaact 240 
gtgacttcgc ctgggagcgt gtgcggggcc 300 
ggtccgctgg gtggtaccct ctggaggggt 3 60 
ttatgaaatt cacccccttt cctggacact 420 
aaacagatgg cactttgaag gggcctcacc 480 
cccctcacct cctctaaggt tgggcagggt 540 
gctggggacc tggtaccctc ctggctcttg 600 
gaaggtgggg tcctggagca gaccaccccg 660 
ggggagcccg tctcagtgtt gagccttttc 720 
gccccagcta cccttcttct ccagctgggc 780 
cccttgtcct ttcccttcag taccctctca 840 
cccccacccc cagctcaatg gactggaagg 900 
agttctacct caggcagctc aagcagcgac 960 
tcccatgtgg tggcacaggc ccccttgagt 102 0 
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ggggttatct ctgtgttagg ggtatatgat gggggagtag atctttctag gagggagaca 1080 
ctggcccctc aaatcgtcca gcgaccttcc tcatccaccc, catccctccc ?ag??clttg "JJ 
cactutgatt agcagcggaa. caaggagtca gacattttaa* gatggtggca gtagaggcta 1200 
tggacaggge acgccacgtg ggctcatatg gggctgggag tagttgtctt ?«?g?Hc" 126? 
aacgttgagc ccctggaggfc actgaagtgc ttagtgtact tggagLftg gggStgacc 1320 
ccaaacacct tccagcfccct gtaacatact ggcctggact. gtttEctct? ?g 1372 

<210> 84 ' ' • ■ ' ' " ' ' r 

<211> 868 ; ( ■ ' ' 

<212> DNA - ■ 



<213> Homo sapiens 



r 



<220> ..- ;^ '' • y . ; ~ . '- j ■ . , \ 

<221> misc_feature -. ' t ■ ' '' 

<223> Incyte ID No: 162009L2CB1 , ; 

f . - ■ 

<400>,84 ,• . ' 

accagcgcca ccgtccccgg cgaagttc'tg cgctggtcgg cggagtagca 60' 
allaltaalt g?!*^" gcggtctgcg tctggcagca ggttggttca tglgftcctg 120 
«™ cat ^9tgag attttatggg tgacagggag agagatcagg cttgacttga 180 

gagcacgtgg gaaaagaagg gggctatctc ttcgcaaaga tttaagtatc haLagaac 240 

tgcaattatg agaagctgtt ttaggttatg tgaaagagat gtttccJcat 300 
SSaiao?S ^o^ 9aa9C ^ Ctgatttga agagaataaa tggattttgc acaaaaccac 360 
aggaaagtcc cggagctcca tcccgcactt acaacagagt gcctttacac aaacctacgg 420 
» aaagatcctc atatggtcag gtcgcttcaa aaaggaagat gaaatcccag ill 
atctaatcat ?L1S 9 « tgat ^tg caaagaacaa gatgcgagtg aagatcagc? 540 
°^!^ tgat tgccct 9 ac 9 gtggtaggat gcatcttcat ggttattgag ggcaagaagg 600 
ctgcccaaag acacgagact ttaacaagct tgaacttaga aaagaaagc? cgtctgaalg 660 
aggaagcagc tatgaaggcc aaaacagagt agcagaggta tccgtgtEgg ctggatttt? 720 
tJtcataaaa ^ Lm" 3 taacgtgcc ' gtattaaaaa ggatg?ggtf tglggatccl 7sS 

aaataaatt? SS-J^? CCaaacctgt accatttccg tatttctgct gtagaagtag 840 
aaacaaattt tcttaaataa aaaaaaaa oca 



<210> 85 
<211> 3388 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1678765CB1 

<400> 85 



atatSXS g " agcagtC tgtactcagt ttgtcaagag agcagctttc tgttttgatt 60 
^tSL™ ^ 9aaa aaagacat 9 c tagttaaatt tcaagatgtt ttacttagaa 12 0 

gatgacaagg aagatgaggt ggtttgtaaa ggctcattaa gtaaaactca agatgtttac 180 
catgacaagt cccctcctgg tatcttgtct caaaccatga attatgtggg acagltggct 240 

gggtgfat? 3 atatcaSr 9aaggaaCtC tacaagggca ttaaccajgc caccctctct ISS 
cacattc^ 9 a ^ gtCatcg ^ 3gt a =agcag caggatggca gctatcagtg ttcacctttt 3 60 
Ittlttn^ ttggaaagct gggagtcctg agatccaaag agaaagtgat tgatatagaa 42 0 
atcaacggca gtgcagtgga tcttcacatg aagttgggtg ataacggaga agctttcttt 480 
gttgaggaga ctgaagaaga atatgaaaag cttcctgctt accttgccac cLaccaa 540 
a^^ aa f a ^ cagttctt taaagatatt gacacccctt tggtgaaatc gggtggagat 600 
S "" g39ttC agaca ^ t ca cacgtcttgg aaacagagac SftStLt 660 

tgaaaaagaa aaaacgaagg agaaagaaat acaaacagga cagtaagaag 720 
gatScaaaa aaa.f Ct9C tgCtgcagaa ^acacatgtg atgtaggcgt gagctccga? 780 
99 999 ccca 9gc agcacgagga tcttcaaatg cttccttgaa agaagaagaa 84 0 
^! aa9agC "ttgctctt ccattctggg gatcattacc ccttatc^ga tggagattgg 900 
^"f tttag a9accaccta tccccagaca gcgtgtccta agagtgat?c agagctggfg 960 

Cggagagcct ^tcagatca gagtatcaca tggag£ggac g?gggg?gga 1020 
"?^ agagt cca <=caagg t cagcaaaaga gaacgatctg accaccatcc L?gacagct 1080 
aatoaa a t a « CatCagaaaa ^actcatttt cgggtaattc ccagtgagga calcctcltc 1140 
agtgaagttg agaaggatgc ttccatggaa gacactgtct gtaccatagt gaagcccaaa 12 00 
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cccagagccc tgggtacaca gatgagcgac 
cctcttgaga gtactcagat ttcatctatg " 
ttagcggagg cgccctcaga atccaaaccg 
aaaggtgttc acaaaagaat ccaacaccag 
aagggtctag aacctgaagt tgcagctctt * 
tccaggcagt ggcccgagtc tgacacactc 
agcgcagctg" cagatagcgg caccgagtgc 
gttaccctct ccctttgcgg gggcctcagt 
atggagcata tcattactta tcacgaattt 
aaccttgtaa taaggatata taatcgttac 
cttagcttgc . aagtattcca gaagagcttg 
gacaagatgc caaagaaatc tggtcgctgg 
aaacagctgc cagaatccaa ggagggaaaa 
tccagctcca aggagccggc cggtgccagg 
ggatcacagg agctcgaaga atccatcaca 
cacggcagca* caacttcata taagaagtct 
ctgaagctcc acgatggGcg aaatgatgtt 
acctgtcgct gtgcagggac catttacctg 
gatattgacg ggacaataac caagtcggat 
aaagactgga cccaccaggg tatagcaaag^ 
aagtttctgt actgctcgge tcgtgccatc 
cactgggtca atgacaaggg cacaatcttg 
agcttgttct ccgccttcca cagagaagtg 
gagtgtctaa atgatatcaa gaatctgttt 
tttggaaacc gtccaaatga tgtctatgcc 
atattcaccg tgaaccccaa gggtgaatta 
tcgtatcaca ggctgagtga gctcgtggag 
aattccgctt ttccctgccc ggagttcagc 
gaagtggacc tggatgacct gtcttgaggc 
cccctcccca cagcaaggga aggcagctgg 
cccagcgggg acgggtgctt ctggagctgg 
agctgctcag gctcggcagg tctgcagctc 
cctggggctg gaggcctggg atccctcctt 
gctggggcct ccttccattg catcatttta 
gaccaaaaca ctgcacacag tgaagtgttc 
tgtaaccagg gctgtcttca gatgtcaggg 
tgctggctga agctgcctgt gcacgttt 



ccaacatctg tggcagagct tctcgaacet 12 60 
ttagatgctg accaccttcc caacgcagcc 132 0 
gcagctaaag tagactcgcc gtcaaagaag 13 80 
ggacctgatg atatttacct tgatgactta 1440 
tatttcccta aaagtgaatc ggagcccggt 1500. 
tctggctccc agtccccaca gccc.gtggga 1560 
ctctcaga.tt ctgccatgga cttgcctgac 1620. 
gaaaatggaa aaatttcaaa agaaaaattc 1680 
gcagaaaacc ctggacttat agacaatcct 1740 
tataactggg ctttggcagc tcccatgatc 1800 
cctaaggcca, cagttgagtc ctgggtgaaa 18 60 
tggttttggc gaaagagaga aagcatgacc 1920 
tctgaggcac cgccagccag tgaectgcca 1980 
ccggccgaga atgactcctc gagtgacgag 2040 
gtggacccca tccccacaga, gcccctgagc 2100 < 
ctccgcctct; cctcagacca gatcgcaaaa 2160 
gtgtttagta Ctacaaccca gtatcaaggc 2220 
tggaactgga atgacaagat catcatttct 22 80 
gctttgggac agattctccc acagctgggc 2.340 
ctctaccatt ccatcaatga gaatggctac 2400 
ggcatggccg acatgacccg tggctacqtg 2460 
ccccggggcc ccctgatgct gtcccccagc 2520 
atagaaaaga aaccagagaa gttcaaaa^tt 2 580 
gccccgtcta agcagccctt ctatgctgcc 2640 
tacacacaag ttggagttcc agactgtaga 2700 
atacaagaaa gaaccaaagg aaacaagtca 2760 
catgtgttcc cccttctcag taaggagcag 2820 
tccttctgct actggcgaga cccgatccct 2880 
ggcacctcag tgggtgggca gggcttggtc 2940 
ctcttctgct gacctcagat accagccttc 3000 
tcccgccatc ctcctttgcc ttcccaggcc 3060 
agctcctgga aggagaaggg aggaactggg 3120 
tgtgggtcgc acacatgttt cctgctgtga 3180 
aaggaagaaa aaagcagcta aaaaagagtg 3 240 
cagtttccac tgggcagttg aggtggctcc 3300 
tccctgaact gctgctggcc cagtcagtga 3360 

3388 



<210> 86 
<211> 1707 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc„feature 

<223> Incyte ID No: 1708229CB1 



<400> 86 

cctcgtttca ccagccttgg ccagcccttg cctctggaaa gggggcagct gtttgtctct 60 
gccaggcgtc ccattggcca gggagtgagg ctggagggcc cggcagcagg catttgcgag 120 
tgctggccag ccggcacccc gccccgcttc tcctccccac cttcccgtcg cccactcatg 180 
ctgggagacc actgcagtct ccctgaagac caagcccggc ccggccagtc cctgcaaagt 240 
ggactctgct gcaaaatggt gcttcaggct gtcagcaaag tgctcaggaa gtccaaagcc 300 
aagcccaatg gcaagaagcc cgctgcggag gagaggaagg cctacctgga gcctgagcac 3 60 
accaaggcca ggatcaccga cttccagttc aaggagctgg tggtgctgcc ccgcgagatt 420 
gaccttaacg agtggctggc cagcaacacc acgacgtttt tccaccacat caacctgcag 480 
tatagcacca tctcggagtt ctgcacagga gagacgtgtc agacgatggc cgtgtgcaac 540 
acacagtact actggtatga cgagcggggg aagaaggtca agtgcacggc cccacagtac 600 
gttgacttcg tcatgagctc cgtgcagaag ctggtgacgg atgaggacgt gttccccaca 660 
aaatacggca gagaattccc cagctccttt gagtccctgg tgaggaagat ctgcagacac 72 0 
ctgttccacg tgctggcaca catctactgg gcccacttca aggagacgct ggccctggag 780 
ctgcacggac acttgaacac gctctacgtc cacttcatcc tctttgctcg ggagttcaac 840 
ctgctggacc ccaaagagac cgccatcatg gacgacctca ccgaggtgct atgcagcggg 900 
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gccggcgggg 

cagaaccacg 

agacggtggt 

tggtgaggtc 

acggccccgg 

agcctttggg 

ttctgtgggg 

tggccgcaaa 

tgctctgggg 

caggcctcgc , 

agtgccctgc 

gaacacccat. 

t-gctgcacca 

aaagaattct 

<210> 87 
<211> ,1752 
< 2 1 2 > DN A 
<213> Homo. 



tccacagtgg 

tgaaggagag 

gtgtgttctc 

tgagagggc'c 

ccgtgtcctg 

cccattcact 

ttttcctttc 

ggaggaactc 

gtggggctrtg 

caggagtctt 

acctgctcct 

ggtgcagtcc 4 

gcggaaggct 

ctcagaaaaa 



sapiens, 



gggcagtggg 

atgagccccc 

tcctgcatct 

ccgggctgca 

tggccccctg 

cgttcagcag 

tctggtctccj 

gtggggggag 

ctatcacaga 

ggctgttggc 

gctcctgtca 

tcagqcaagg 

gag&£ccgct * 

aaaaaaa 



gatggggccg 
cgggccggac 
gcgtgtgcac 
ctggtgtggg 
-tcggatggat 
acacgcatgg 
gtgcagcccc 
ggtgtgattt 
gaccctcctt;' 
agctcagagg 
ccccttcetg 
ccaggacggg 
taccttagtt 



gcagcggggg 

aggggcacac 

acatgtgctg 
Ctgcacaggc 
gcgtgcfcgtt 
gactgatgct 
4 tgccctccct 
gcagacctgg 
;ccctctcacc 
tgggggaggc 
t ctgcctcctc 
gctgaggccc 
catctgti'tca 



cccgggagca 
gtgtgcaaag 
ggcacgcgt^g 
acagacgcag 
tgtagagaag 
ttgagtttt^c 
cgggtgctgc 
gtctctjgctc 
cctcctctcc 
ctgtggtgtg 
catgcccaag, 
tgcgtggaga 
ctcgtaataa 



960 

1020 

1080 

1140 

1200 

1*2 60 

1120 

1380 

1440 

150.0 

1*;560 

1620* i 

1680 

1707 



<220> 

<221> misc„feature 

<223> Incyte ID No: 1806454CB1 



<400> 87 

cccgggtgcg 

gctggcgagc 

gggtggatgg 

tggtcatcgc 

ttcgggagaa 

gcggacagtt 

ggaggccctc 

ctgtaaagcc 

cccccagcct 

cagacctgcg 

ccttctggga 

tgcaggcact 

aggcccgtgc 

ctatggcatc 

cggaggtgca 

tgcaggctca 

agcagttcat 

ctcctggcac 

gccccactcc 

ccctcctggg 

tctgtgcccc 

gtgtcctccc 

ccagagtggt 

aagaccatcc 

ggcagtggga 

gtccttgcca 

caggccccaa 

gctgtgggca 

tgtgtgtgcc 

gaaaaaaaaa 



ccgcggcgct 

gcccaggaca 

catccagcgt 

actagcccaa 

ggagcggcag 

tgccagcgat 

ctcagacagc 

acgggctgcg 

ctcacgccct 

gggcctggag 

gcaagagctg 

aagtgcggcc 

cctggaggct 

tgccactgag 

gggcagcctg 

ggctcaggag 

ccagcagacc 

tcaggtcgga 

aagctgactt 

cgctccctct 

acctccccct 

gcagtggccc 

gtcctctggc 

tgcccggacc 

agccctgggt 

ggtcgccagc 

aggccacgac 

tgtgcttgcc 

acttccccta 

aa 



gggggcggca 
ggcatgttgt.- 
gtggtctgtg' 
gcaataggcct 
ttgctgccac 
gtccagtttg 
tgtccacccc 

ctgggctgtg 

gggcctgcgg 
ctcagggtgc 
cgccgggagc 
actgctgagc 
gagctgcagc 
cgcctgcacc 
gctctggtga 
ctggaggagc 

ggggctgcgc 

gtggttctgg 
cccaacccac 
gagtcccatg 
ggggcaccgg 
ccatgacgca 
agcccagccc 
acagaaggag 
ttggcctcag 
accctggaga 
tgcctgttgg 
tgcgggagag 
ccccaacgtg 



ggttgcggcg 
tgggactggc 
gggtctcaga 
agactggccg 
aagagtgtcc 
tcctgaggcg 
cggaacgctg 
agccccgcaa 
cccctgtgac 
agaggaatgc 
aggcccggga 
atgccgcccg 
tggcagcgga 
aggacctggc 
gccgggccct 
tgaaccgaga 
tgccaccgcc 
ggggaggctg 
agggccctct 
ctggtgccca 
gccctcctgt 
gaactcctgg 
caggctctgt 
agttggcggt 
gagctggggg 
agcatggggc 
ggacaggaga 
gtccttcact 
aaaacctcaa 



gcgccggagc 

ggccatggag 

gcagaccacc 

ctttgtgctt 

agtgggcgcc 

cacagggccc 

cctaattcgt 

aacactgacc 

acccacacca 

tgaggagctg 

gcgagaggga 

gctgcaggcc 

ggcccctggg 

tgttcaggag 

ggaggcagca 

gctccgtcag 

cccacggcct 

ggaggtgagg 

gcc tccagcc 

gcctaggccc 

ggctgcagcc 

aggtagcagc 

gacagcctag 

cacagagggc 

tgcagtgggg 

gtagccagct 

tgcatggaca 

gtgtgtacac 

taaactgccc 



gggtctccag 

ctgaaggtgt 

tgccaggaag 

gtgcagcggc 

caggccacct 

agcctagctg 

gccagcctcc 

cccgagccag 

ggctgctgca 

ggccatgagg 

caggcacgcc 

ctggacgctc 

cccccctcac 

cggcagagtg 

gagcgagcct 

tgcaacctgc 

gacaggggcc 

acctggccca 

agagaggagt 

cgagggtatg 

acctcagcct 

agctcctgcc 

tgagggctgc 

tcctctgcca 

gactgcccta 

cggaacttgc 

gtgtgctcaa 

agcaagagca 

gaagcagctt 



60 

120 

180 

240 

3 00 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1752 



<210> 88 

<211> 2461 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
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<223> Incyte ID No: 1806850CB1 
<400> 88 

ctgaaagaga gattggaggc ttttacaaga 
gaggtttttc agcccatgtt aatggaatat 
aagaaagtga ttgcacaaca ctgctctcag 
agcctatgga atcatgctga agagcgacag * 
tcagataaag aagcagaagt gtcagaacac 
gtaatgctgt caa'ttttcag ctatcttaat 
agcatgaaat ggtctcagct gacaaaaacg 
cattgggcca gaggtgactg gtatagtggt 
gatgaatggg tgaaaaatag gaaagatgaa 
gctgacattg atgaatctga agagtctgcg 
atggaaaaac gtttactcca tggcttaatt ' 
gtaaaaacct tagtattagc atacagctct 
ttagagcttt gtcctaacct ggagcatctg 4 
gcatttgaca gttggtcttg gcttggttgc 
ggttgtgaga aaatcacaga tgtggcccta 
acatctcatc aaagtggctt tttgaaaaca- 
aaaaataaag acatt^accat gcagtccacc 
aacaagggca ttggagaatja aatagataat 
gagaatttca cttctcctta tgtgtggatg 
gatactgtgg aatggagaca tagaaatgtt 
aactttagtt gttccacctc tggttgtttt 
gtctgttggc agcagcattg tgcttctcca 
tgtacaggaa cagctttaag aactatgtca 
aaagcagcaa ggactagatt gcctagggga 
tctgatcaag agactggacg tgtacttctg 
acagaccatg gtctcagggt tttgactctg 
aatctctctg gttgtcttac tataactggt 
ccttctctga atgatgaata cttttactac 
accgccagtg gatgccagaa tttgcagtgt 
tgacccttga cttctgatct ttgtctactt 
ctttactcat agcacatttc ttgtgttaac 
cccatttctt acaacttcag aaatcttaat 
gcaaattata cttttggttt agaaagggat 
tacatttttc ttttaaatga aatgctttaa 
tatttcatag ataattttga gttttaaagt 
taacactgta ccaagctttg cagatctttt 
aagacagcac atttttggaa actaatctct 
tctcagaagg ccaattcaaa caaacccaca 
tggcttctga gtgttagctc agtgagctag 
gggatacttt gttctcacca ctgtccctat 
taccacaatc atttgaagaa atgtataaat 
a 

<210> 89 
<211> 965 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1851534CB1 

<400> 89 

ctttcagaaa aacccttgtt gctgctgtaa 
gacacagatt tattctttta catttcttga 
agcggcagag gcccgatgag agaaagggaa 
ttctctttct ctttcatgca agggatcatg 
attgactccg ccaacatccg tcaggaggat 
gatggtggcc agacaccata tgaagctact 
acagaagatc ttcctccact cacaaatggg 



gattttcttc ctcacatgaa agaggaagag 60 
tttacctatg aagagcttaa gtatattaaa 120 
aaggatactg cagaactcct tagaggtctt 180 
aagtttttta aatattccgt ggatgaaaag 240 
tccacaggta taacccatct tcctcctgag 300 
cctcaagagt tatgtcgatg cagtcaagta 3 60 
ggatcgcttt ggaaacatct ttaccctgtt 420 
cccgcaactg aacttgatac tgaaccfga£ 480 
agtcgtgctt ttcatgagtg ggafgaagat 540 
gaggaatcaa ttgctatcag cattgcacaa 600 
cataacgttc taccatatgt tggtacttct 660 
gcagtttcca gcaaaatggt taggcagatt 720 
gatcttaccc agactgacat ttcagattct 7 80* 
tgccagagtc, ttcggcatct tgatctgtjbt 840 
gagaagattt ccagagctct tggaattctg 900 
tctaeaagca aaattacttc aactgcgtgg 960 
aagcagtatg cctgtttgca cgatttaact 1020 
gaacacccctt ggactaagcc tgtttctdict 1080 
ttagatgctg aagatttggc tgatattgaa 1140 
gaaagtctct gtgtaatgga aacagcatcc 1200 
agtaaggaca ttgttggact aaggactagt 1260 
gcctttgcgt attgtggtca ctcattttgt 1320 
tcactcccag aatcttctgc aatgtgtaga 1380 
aaagactcaa tttactttgg gagtgaaaaa 144 0 
tttctcagtt tatctggatg ttatcagat'c 1500 
ggaggagggc tgccttattt ggagcacctt 1560 
gcaggcctgc aggatttggt ttcagcatgt 1620 
tgtgacaaca ttaacggtcc tcatgctgat 1680 
ggttttcgag cctgctgccg ctctggcgaa 1740 
catttagctg agcaggcttt ctttcatgca 1800 
catccctttt tgagcgtgac ttgttttggc 1860 
ttaccagtga attgtaatgt tgtttctctt 1920 
taggtctttt caaaagggtg agaacagtct 1980 
agaatgttgg taatgccatg tcatttaaag 2 040 
ccatggaggt gattggttct ctttacacat 2100 
ccgacacaca tgtctgaaga cttattttca 2160 
tttccgtaat atttccttta tttcaatgat 2220 
tttaaggttc tttaggatta tagaataaat 2280 
gaaagcacca atcgatattt gtttccttta 2340 
gtcatcaaat ttgggagaga ttttttaaaa 2400 
aaaatctact ttgaggactt taaaaaaaaa 2460 

2461 



caaattacaa caaacttaat ggcttaaaac 60 
tgtctgacat gggactaaaa tcaaggtgtc 120 
agttaaggat gctggagcag aacaatggat 180 
ggaaacacaa ttcagcaacc acctcaactc 240 
gcctttgata acaacagtga cattgctgaa 300 
ttgcagcaag gctttcagta cccagctaca 360 
tatccatcat caatcagtgt gtatgaaact 420 
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caaaccaaat 
gttagaaact 
gaaattctgg 
gtgattccaa 
aatggcaggg 
^agtgagcaca 
catgaotcat 
gaacaaaata 
agtactaaaa 
tgtcc, 



accagtcata 

ttagccccac 

aaaaaccttc 

agaagactgg 

aattgtttga- 

cgaaatcaaa 

caagatctga 

tgaccaaatg'' 

gagggaagcc 



taatcagtat 
tgactattat 
ccctccacag 
ctcacctgaa 
gtcttccctt. 
gcatgaaagc 
agkgcgcaag 
agagggttga' 
ccagttcagc 



cctaatgggt 

cattcagaaa 

ccaccacctc 

attaaactaa 

tgtggagacc < 

agaaaagaaa 

tcacacaaaa 

cactgtatca 

caatactatc 



cagccaatgg 
ttccaaacac 
ctccttcggt 
aaataaccaa 
ttttaaatga 
agagga^aaa 
tccccaaatt 
gaaaaaccaa 
ttctgttcca 



. <210> 90 ' 
' <211> 2555 

<212> DN A 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



ctttggtgca 480 
aagaccacat 540 
accacaaact 600 
aactatccag 660 * 
agtacaggca 720- 
aagcaacaag 780 
agaaccagag 840 
gggaagaacc 900 
acaacggtag, 960 
( ' 965 

c \ 



1868749CB1 



<400> 90 

agcacgtccc 

tcaacagtga 

cagggtatcg 

ggtgcaacct 

tcctagtgcc 

aacggctggt 

tgaagtaccc 

gcactcgcat 

atggtgacag 

gttggtcacg 

atcaccccac 

gagcagtcta 

ccctgaatct 

tgatgatctg 

cctcttccca 

cgccaaaagg 

cgaaagccgc 

gcatgaagct 

aacgagtggt 

agtgggtgga 

accagctcca 

caaagaagaa 

gacccctcag 

agatctcgtc 

tagacgtggc 

tcaagcagga 

agcccagcgt 

tcctctctgt 

gtctcctggg 

cgcctgaggc 

gaacgctagc 

t tgccaggga 

cctgaacgag 

gacagtgagt 

ctcagcctca 

c ttgtctcgg 

ccttgagctg 

ggggaggaca 

gaagccccgc 

ctggagcctc 

aagaaaccat 

ggaggacaca 



actctatgac 

tgctgtgctc 

ggtgctgctt 

gggaacagtg 

cccacggacc 

gggctccagg 

ctttaggcag 

ggctgtggtg 

tgacgacgac 

acgtgtgggc 

cttccggaag 

cacagaaggc 

gggcaacatc 

tgtggacggg 

cgccatcttc 

ttatgaggca 

tccatcgaga 

ggaggccgtg 

gcatcggctc 

ctgcgagtcc 

gcctcctgtg 

aaagaaacag 

acaggggtcc 

ggagcctgtt 

ctcgcccgac 

aacagacgac 

ttctctacca 

gtaaattctg 

acccgcctgt 

cccagaactc 

tgcctgctct 

tggggccacc 

tcatgtaaat 

tgtgtggtgg 

tgtgttggtc 

cagctaagaa 

ctttctgtgt 

ttgggaggaa 

taaaaataat 

tagagagctg 

gcctggaggg 

tctagctgcc 



cagtgggagg 

cccagccggg 

cggtatgaag 

gatgtccacc 

atccatgcca 

acgcttcccg 

ggcatgcggc 

gacacagtaa 

ttctggtgcc 

cacggcatca 

atctactgtg 

ggttggtttg 

tgcgtggcaa 

gggccctcca 

ccggccacct 

cagactttca 

ctctttaaca 

gacctgatgg 

ctcagcatcc 

ccagacatct 

gccgcagaac 

tttgggaaga 

aagaagcccc 

cctggcgaga 

aaggcttcaa 

tgagccttcc 

ccaccaccat 

cccggtgctg 

tgcttctgcc 

gtctgtgaac 

tccttaagat 

actgtcacac 

taagttctag 

gggcagcctc 

ctctgctcct 

gcagtgacca 

ttgtgaggac 

gatggcctga 

tcatccaaga 

ggcttgtatg 

gccgtgaaca 

attgcaacct 



atgtgatgaa 

tgtactggat 

gctttgaaaa 

ccattggctg 

agttcaccga 

tggatttcca 

tggaagtggt 

tcgggggtcg 

acatgtggag 

agatgtcaga 

atgccgttcc 

aggaagggat 

ctgtctgtaa 

cagatggctt 

tctgtcagaa 

actgggagaa 

tggattgccc 

agccccggct 

actttgacgg 

accccgtcgg 

cggccacacc 

aaaggaaaag 

tgctggagga 

tcattgctgt 

gtccagagct 

tgcctccagc 

gcctccacct 

tgaaggctgg 

ctcccctgtg 

caccttttcc 

ggcctccccc 

tgtggaatac 

agcagctctc 

tgcctcaaaa 

cctagctccc 

ggatgtggat 

tgactcccat 

gtgtgcactt 

ttcctttgta 

ttcttttggc 

cagaaccctc 

cactgggctc 



agggatgaag 

cgcctctgtc 

tgacgccagc 

gtgtgccatc 

ctggaagggc 

catcaagatg 

ggacaagtcc 

cctacggctc 

ccccctgatc 

gaggcgaagt 

ttacctcttc 

gaagctggag 

ggttctcctg 

ggactggttc 

gaatgacatt 

ctacttggag 

aaaccatggc 

catctgtgtg 

ctgggacagc 

ctggtgtgag 

gctgaaggcc 

aatcccgccc 

cgaccctcag 

gcgtgtgaag 

gcctgtctcc 

ctggcttcta 

gactttggct 

acggtggagg 

gaaaggtcta 

agccagagtt 

cgacccgcca 

aagacagtga 

tgagcaggat 

attcaccaag 

cagggatgtt 

tttggcgacc 

t tcctaaagg 

tggctctgct 

gttaaagggt 

cttttgttcc 

aagacaagga 

cccagactct 



i 

gtggaggtgc 

atccagacag 

catgacttct 

aacagcaaga 

tacctcatga 

gtggagagca 

caggtgtcac 

ctctacgagg 

cacccagtgg 

gacatggccc 

aagaaggtac 

gccattgacc 

gatggatacc 

tgctaccatg 

gagctcacac 

aagaccaagt 

ttcaaggtgg 

gccacggtga 

gagtacgacc 

ctcaccggct 

aaagaggcca 

actaagacgc 

ggtgccagga 

gaagagcatc 

gtcgagaaca 

gctggaagcc 

tggagactga 

acctgctggg 

tatgacgggc 

cccaaagctg 

cggccctcag 

actctgtctg 

aaggtcccct 

cagaatgcct 

ggggacccag 

tgtgtggtgg 

aaatgccccc 

acctgctcct 

ccagttctga 

tacctaaatg 

tgacagagct 

gtgtgtgaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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aattaaaccc cctgcttgct tgagaaaaaa aaaaa 



2555 



<210> 91 

<211> 4172 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



1980010CB1 



t<400> .91 
ccacaaagct 
tggaaaaaca 
aagaggcacg 
tcagtgagtc 
aoacatgt.ct 
gtcgccttga 
atcctccagc 
gtattgactt 
atagctgcat 
ttcaagagga 
tcctgccttt 
cccagtcccc 
gggttgcagg 
aggcacttcg 
caggttatcc 
acttggccac 
ttgaacttct 
atttccagat 
cagtacaaga 
ccactgctac 
aggccgtcag 
ggcaaaaatg 
aagggtgtca 
ggacctatga 
ctggaaaatt 
tgcaactagc 
ccttaccaca 
ctctccagct 
tcagggacaa 
ccaggcatgt 
' agcagttaca 
gccttcggaa 
ctatccttgg 
cacaacgtta 
cggacagtgg 
agactttgaa 
ctattggtgg 
gctgtgcaga 
agtttaaggt 
ctcttgaagc 
ttcccaaaat 
ggttacagaa 
caccggagtg 
gtgacctcat 
ctgtggaagc 
agaagccaag 
cctatgcaga 
acagcttcat 
tctatgatct 
tagatggtca 
acatctcacc 



tcatgacatg . 

tggactcgag 

caagctgatg 

tcattggaga 

agattctgat 

aaacatccac 

tggtatagcc 

ggttttggct 

ggatctagcc 

gctagatctt 

gcaagtgcga 

cacatgctat 

tgagaaccca 

cttccatgac 

taaaagttgg 

tcgtcaagag 

tttggcagct 

ccatcatgaa 

ggatgaagta 

caccatgaaa 

tgatgggcag 

tggtggtgca 

tcctttttat 

cacctatcag 

ggcagaggct 

aagtgaagcc 

agtgttagat 

ggcagcgtat 

gtgccatcct 

gactcgacat 

ctgctacaat 

gggtgtggac 

tctggcagaa 

cagtgtctcc 

tttgtccaca 

gactgatcca 

ctttgatcac 

tttggggaac 

tgttgcatca 

attggagcca 

ccctgaaaag 

gttgttctgg 

gcttcatgcc 

cactgtggta 

ccgtaaagag 

gaaaagaaac 

tactttgaat 

cctttctctg 

gtcccgatca 

gcctctagca 

caaggatata 



gtagaccaac 

aaaccaattt 

gttagattga 

acgttgctgc 

gcctgctatg 

ctggctggac 

cataaaggga 

gccagcagag 

aggtgctgct 

atccaagccg 

ttgtgccctg 

aaacaatcca 

gaagaaaggc 

tacaaagcag 

gatgtttgta. 

ctcatggctt 

agcagctctc 

ggaggggaaa 

ggtgttccag 

gtcctttcca 

tggtggaaga 

tatcaaatcg 

gaatctgtca 

catgttccag 

aaaaataaag 

ttgccaaatg 

gctaacc.ggt 

tactatagcc 

ctttacaggg 

gagcacgaag 

gaacgtctcc 

gtgcagcggt 

actctagagg 

cgctgggaag 

ctagaaattg 

gaagcctttc 

gaaaggctgc 

tgtgccatta 

ggtcttaatt 

gttctttcaa 

gatggacaga 

actggagacc 

tatgatgtct 

gatgcagtta 

atgactagaa 

tcagaagacg 

catctggaga 

aagaatagtg 

gaaaaagaga 

atgattcagc 

gtgcagagtg 



tggaacaaat 

catttgttaa 

cgaggcacac 

aagacatgtt 

agatatntac 

agatgatgca / 

acccccacta 

agtacttcaa 

tacaactgat 

ttggatgtct 

atcggatcag 

ccaagcttct 

ggggacaggt 

ccagtatgca 

gccagttagg 

ttgctttgac 

tgcagacaga 

atatcagtgc 

gtagcaattc 

acaccacaac 

agtctttaac 

gaactacagc 

tctcaaatcc 

tggaaagctt 

gagaagtatt 

acatgacctc 

gctttgaaaa 

tccagatcta 

ctgatcccaa 

cctggcctga 

tggatttcac 

ttacngcaga 

aaagcgtcta 

tttttatgac 

aaaatagagc 

accagcacat 

agtattattt 

aaccagaaac 

acaaaaagct 

gtcaaaatat 

tgctttcccc 

ctcatctcat 

gcatgaagta 

cattttctcc 

aggctattaa 

aagctcaaga 

aatcacttgc 

agcaggaaac 

aacttcatga 

agctgctaga 

caatcatgaa 



tctcagtgtg 

aaacactcaa 

tggccggaag 

aactatgcag 

agaaagcctt 

ctgcagtgpt 

cagggtca'gc 

ttcttctacc 

aacagacaga 

tgaagaattt 

tctcatcaag 

gggccttgct 

tctaatcctt 

ttgtcaggag 

acaatcagaa 

acattgccct 

aattctttat 

ttcaccatta 

agctgaccta 

caccaccaaa 

ttaccttcga 

caatgaagat 

ttttgtcgct 

tgcagaagta 

tccaacaact 

ggctcttgct 

gcagtccccc 

tgcccgattg 

agaactaatc 

agaccttatt 

tcaggcgcag 

tgaccagtat 

cagcattgct 

ccatttggag 

ccaagacctt 

ggtcaagtat 

cactcttctg 

ccacattcga 

gacagatgaa 

cttgtctatt 

aagctctctg 

taaacaagtc 

ctttgatcgt 

aaaagctgtg 

gacagtcaaa 

agctaaggat 

ccacctggaa 

actgcaaaaa 

tgaagctgtg 

ggtggcagtt 

aataatttct 



tcagagcttt 

tctagctoag 

cagcctcctg 

cagaatgtat 

ctgtgctcfca 

tgttcagaaa 

tacgaaaaga 

aacctcactg 

ccccctgcca 

ggggtaaaga 

gagtgtattt 

gagctgctga 

ttagtggagc 

ctgatggcca 

ggttaccagg 

cctagcagca 

caaagagtga 

actagtaaag 

ttgcgctgga 

gcggtgctgc 

ccccttcagg 

ctagagaaac 

gagtctgaag 

ttgctgagaa 

gaagttctct 

taccttcttg 

tctgcattat 

gccccatgtt 

aagatggtca 

tcactgacca 

atccttcagg 

aaaagggaaa 

atttctctgg 

ttcctcttca 

catctctttg 

atttacccta 

gaaaactgtg 

ctgctgaaga 

aacatgagtc 

tccaaacttg 

tacaccatct 

ccaggctctt 

ctccacccag 

accaagctgt 

cattttattg 

tctaaagtta 

accctgagcc 

tacagtcacc 

gctatttgtt: 

ggccctcttg 

gcattgagtg 



60 

120 

180 

240 - 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 . 

1020. 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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gtggcagtgc 
cagcagtcca 
agtggctgcg 
tgcagatttt 
ttagaactga 
ttgagaatga 
aggctgaatt 

* atgtcataac 
tggagaacaa 
ccaagcagat 
ccctcctgct 
tggcactgga 
ttctttccct 
cacgtattgt 

^tgggcagaca 
tgagggggac 
gggtgtgagg 
tgttcccaga 
ataaaactgt 



tgaccttggt 
cgccagtgcg 
gcctttctgt 
ggggcaatca 
agccattctc 
agagaaccgc 
tcagcacttg 
caataatcca 
g^aaggatcg 
gctgcctgca 
tccatctctg 
gcaaatcacg 
gctcetggat 
tgaccaectc 
.cctgcgggag 
tcaccaggcc 
gccacctgtg 
agcaaca^tgt 
atggaaacga 



gggccaaggg 

gacaagggtg 

gctgatgacg 

tttcacctga 

aaagcctcct 

tactgtctat 

gttttacttt 

tgggtga^gac 

gggaatgaag 

gagggtgtga 

aaacttctcc 

gcagtcacta 

gccaagctgc 

ttggctagcc 

gccggccatg 

ttca(gaacct 

gccctgctccv 

gtatctgccg 

tgaaaaaaaa 



acccactgaa 

aggagctggt 

cctggccggt 

ctgaggagga 

ggccccagag 

tcatggaact 

tgcaagct^tg 

tagctacagt 

ttitgaaaat 

aggagctgtg 

tc^agagccg 

cggtgaatga 

tggtgaagtg 

tccagcaagg 

aagccgaagc 

tcagtacagc 

ttagcagaaa 

attgttctcc 

aa 



ggtcctggaa 
ttcacccgag 
gcggccccgc 
cagcaagctc 
acaggtagac 
cctggaatct 
gccacctatg 
gatgctaacc 
gtgtcgctct 
tctgctgctg 
agatgagcat 
ttdcaiattgt 
tgtctccact 
gcgctgggat 
cgggtctctc 
cctccgcgca 
aagcatctgg 
acggttccaa 



ggtgttgttg 
gacctgctgg 
attcacgtgc 
ctcgtgttct 
atagctgaca 
agt caeca eg 
aaaagtVaat 
agatgtacga 
ttgtataaca 
cttaaccagt 
ctgeacgaga 
gaccaagaac 
cccttctatc 
gcagaggagc 
cttctggccg 
gcacagcact 
agttgaatgc 
caaattgeaa 



3120 
3180 
*3240 
3300 
,3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 v 
384 0 . 
3900 
3960 
4020 
4080 
4140 
4172 i 



<210> 92 . 

<211> 4037 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2259032CB1 



<400> 92 

teggagtgee gcccgcggcc ccgagtcggt 
egtaggeget gcgccctcgg ccgggcccat 
ctttgcctcc accagcgccc tggcctccgc 
gccgcgacgc ccggcccgca gggcaggtaa 
aaattttctt aaagctgeat atactccaag 
ctgaatacat gatttaaaca agagatttcc 
ggaagtttga cttgaacacc acgtcccatg 
cgctgcagga gttaatggat gaagatgaca 
agctgctgga cttcctgtgc aggcagcagt 
aggatccgcc cctggacatg gaggagaagg 
agcttctgac ttgtgatgtg ccgcagatca 
tgagcctcct gtacgacttc ttggaccatg 
ttttcagcaa gaccattggc aatctcattg 
tgaagaagaa ggacaagttc ateagectgg 
tggacctget gctgcgcctg gtcagctgtg 
tgcactggct gaatgaagag aaggtcatcc 
aggatgaaga taggcagtca aatgettetc 
gagaccaggg cagtcagctg caagaggctc 
agtccaggca ggactgtgtg gagcagcttc 
agagctgect cgtcagtggg actcaggtgt 
ggacagaggg cttggtggac tccttttctc 
geagegtact acacggcatc gagcctcggc 
cgcccaagaa gaaagegate ctgaccacca 
cccgtctgca tggcgcccgc ctcatggcag 
accaggagct ctgccggctc aacacgatgg 
cctggaataa etttttgeae ttccaagtgg 
ctgcccgtga ggagaggaca gaagccagcg 
agaaegggaa ccggagcctg gagactcccc 
tggtgaccca cctgttccag aagtgctgcc 
ccaacgacca cacgcaggca gcgggtggca 
ggatcgccaa cgcggtggtg cagaacctgg 
aggtcatccg agggctccct gcggactgcc 
cgctgacgga gacgaaccgc aggaacactg 



ctcgagccgc cggccggccg tgccggtgtc 60 
gtgtgtgcgg cccgcccgag gccgcccggg 120 
tcgggcctcc acacgggcct ccgaagagct 180 
agagattata aatcttccac tgaatgaaaa 240 
aaaaaaacca caaatgtttt cctgttttgc 300 
acagaagctc tgcggccgtc acgatgttct 3 60 
ttgacaagct gctggacaag gagcatgtga 42 0 
tettgeagga gtgtaaggct cagaaccaga 480 
gcatggagga gctggtgagc ctcatcacac 540 
tccgcttcaa atatccaaac acagcctgtg 600 
gcgaccgcct cggtggggac gagagectge 660 
agccgcctct caatcctctg ctcgccagtt 720 
caagaaaaac cgaacaggtg attacgtttt 780 
tgttgaagca catcggcacc teagegctta 840 
tggagecage cgggctccgg caggacgtcc 9 00 
agaggcttgt ggagttgatc cacccgagcc 960 
agactctctg tgacatagtt aggctgggca 1020 
tggagecaga cccgctcctc acagegctgg 1080 
tgaagaacat gtttgatgga gaeeggaegg 1140 
tactcacctt gctggaaacc aggcgggttg 1200 
agggactgga aaggtcatac gctgtcagca 1260 
tgaaggactt ccaccagctc ctgctcaacc 1320 
ttggtgtgct ggaggagece ctggggaatg 1380 
cactgctgca cacaaacaca cccagcatca 1440 
acttactget ggacttgttc tttaagtaca 1500 
aactatgeat ageegctatt ctctcccacg 1560 
gatccgagag cagggtggag cctccgcatg 162 0 
agccggccgc cagcctccct gacaacacaa 1680 
tggtgcagag gatcctggag gcctgggaag 1740 
tgagacgtgg gaacatgggc cacctcacac 1800 
agcggggccc tgtgcagacg cacatcagcg 1860 
gtggccgctg ggagagcttc gtggaggaga 1920 
tggacctggc cttctctgac taccagatcc 1980 
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agcagatgac agccaacttc. gtggatcagt ttggcttcaa tgatgaggag tttgccgacc 2040 
aggacgacaa catcaatgcc ccgtttgaca ggatcgcaga gatcaacttc aacatcgacg 2100 
ctgacgagga cagtcccagc gcagctctgt ttgaggcctg ctgcagtgac cgcatccagc 2160 
cctttgatga tgatgaggac gaggacatct gggaggacag tgacactcgc tgtgctgccc 2220 
gggtgatggc cagacccagg tttggagccc cccatgcttc agagagttgc tcaaagaatg 2280 
gcccagagcg tggaggccag gatgggaagg cgagcttgga agcacacaga gatgcacctg 2340 
gggcaggtgc cccaccggcc cccgggaaga aggaagcccc ccctgtggag ggtgactcag 2400., 
aagcaggcgc catgtggacg gcagtgtttg atgagccagc gaactcaacg cccacagccc 2460 
caggagtggt gagggacgtg ggttccagtg tg.tgggcagc tggcacctca gctccagagg 2520 
agaaaggetg ggccaagttc actgacttcc aacctttctg ctgctccgag tcagggccca 2580 
ggtgcagctc tccggtggac acagaatgca gccatgctga gggcagccgg agccaaggcc 2 640 
ctgagaaagc cttcagcccg gcttctccat gtgcctggaa cgtgtgtgtc accaggaagg 2700 
cccccctgct ggcctctgac agtagctcct ctgggggctc ccacagcgag gatggcgacc 2760 
agaaggcagc gagtgccatg gatgcggtga gcaggggtcc cggccgggag gcccccccgc 2820 
tgcccacagt ggccaggaca gaggaggctg tcggcagggt cgggtgtgct gacagccggc 2880 
tgttaagccc tgcctgcccc gcgccaaagg aagtgactgc tgccccagcc gtggctgtgc 2940 
cccccgaggc tactgtggcc atcaccacag cactgagcaa ggctggcccc gccataccca 3000 
-ccccagcagt ctcttctgca ctggccgtgg cggtccccctr agggcccatc atggcagtca 30 60 
cagcagcccc agccatggtg gccaccctgg ggacagtgac aaaggacggg aagacagatg 3120 
ccccgccaga aggagbtgcc ttaaatggcc cagtgtgatg ctgctgccgc ccggccacgg 3180 
cccaccctgg tcaggctgcc tccttaatcg agaaaactac ctggtgatgc ,aatctttttt 3240 
tttttaattt aatttaattt taaaataaat gctgcattgg taaagctggc agttgaaacc 3300 
agttggacgg cccagcttgc gtctcttctg cctgagtggg cctctcaggt cactcgtgcc 3 3 60 
ctgctggagg acagaggggc acctcagccg cccccaagcc cagagcacag caataaggtc 3420 
ggcctgcagg agccggggtg ggggtggggg tggggggggc aggaccctga gatgccacca 3 4 80 
ggacctgatg ggccaggaag ggcgtggaca tggaggctgt ttttacagtt tttttttgtt 3 540 
gttgttttgt ttttaaagaa tacagaagga gccaagcttt tttgcacttt gtatccagct 3600 
gcaagctcag ggcagagtca agggcctggg ttggaaaaac ctgactcaca ggaatgcata 3660 
attgaccctt gcagctaccc aatagccctt ggagctggca ctgaaccagg ctgcaagatt 3720 
tgactgcctt aaaaacacaa ggccctctag gcctggcagg gatgtccctg tgcccagcac 3780 
agggtgcctg gcagggggag accacaggta tgcaggtggg gggacatggt gtggcactgg 3840 
gggctcgaag actggtttct agcactaccg gtcacggcca tgtcgtccta gaagggtcca 3900 
gaagattatt ttacgttgag tccattttta atgttctgat cacctgacag ggcaccccaa 3960 
acccccaact cccaataaaa gccgtgacgt tcggacaaaa aaaaaaaaaa aaaaaaaaaa 4020 
aaaaaaaaaa aaaaaaa 403 7 

<210> 93 

<211> 2031 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte JED No: 2359526CB1 

<400> 93 

ggcggggctc ggctcgggct ccgcgggcgg gcgggcggac atggcggcca acatgtaccg 60 
ggtcggagac tatgtctact ttgagaattc ctccagcaac ccatacctaa taagaaggat 120 
agaagaactc aacaagactg caagtggcaa cgtggaagca aaagtagtat gcttttatag 180 
acgacgtgat atttccaaca cacttataat gctcgcagat aagcatgcta aagaaattga 240 
ggaagaatct gaaacaacag ttgaggctga cttgaccgat aagcagaaac atcagttgaa 300 
acatagggaa ctctttttgt cacgccagta tgaatctctg cccgcaacac atatcagggg 3 60 
aaagtgcagt gttgcccttc tgaatgagac agaatcagta ttgtcatatc ttgataagga 420 
ggataccttc ttctactcat tggtctatga cccctcattg aaaacactat tagctgacaa 480 
aggtgaaatc agagtgggac ctagatatca agcagacatt ccagaaatgc tgttagaagg 540 
agaatcagat gagagggaac aatcaaaatt ggaagttaaa gtttgggatc caaatagccc 600 
acttacggat cgacagattg accagttttt agttgtagca cgtgctgttg ggacattcgc 660 
cagagccctg gattgcagca gttctgtgag. gcagcctagt ttgcatatga gtgctgctgc 720 
agcttcccga gacatcacct "tgtttcacgc tatggataca ttgtatagac acagctatga 7 80 
tttgagcagt gccattagtg tcttagtacc actcggagga cctgttttat gcagagatga 840 
aatggaggaa tggtcagcct ctgaagctag cttatttgaa gaggcactgg aaaaatatgg 900 
caaagacttc aatgacatac ggcaagattt tcttccttgg aaatcattga ctagcatcat 960 
tgaatattat tacatgtgga aaactactga cagatatgtg caacagaaac gtctaaaagc 1020 



81/93 



WO 01/07471 



PCT/US00/19948 



agcagaagct 

caaccaaata 
. gttccagcct 

tcaccagtgg 

gctttattgg 

atctcctagc 

gcagggaatg 
■ tttcttcctt 

ccggctgcgg 

agaatgtaag 

tttccxctct 

cagcctttta 

caacctggaa 

gcttagcaaa 

tgcagagccc 

gatgctgtta 

ttraggtttaa 

<210> 94 

<211> 820 , * 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2456494CB1 



gagagtaaac 
tccactagta 
cagaatcctc 
tattcttggg 
aaaaaatatg 
ccaactacag 
ccagtccgaa 
catactacat 
caggcagcaa 
atgcttttaa 
ttcctttttt 
tttggtttat, 
aaaaaaaaat 
agattggtgg 
ttattgcagc 
taatgagact 
ataaaatttg 



tgaaacaagt 
atgggaagcc 
tcttagggag 
gcccacctaa 
gaggcttgaa 
aggaccctcg 
acactgggag 
atttcacaaa 
gacggccgtt 
attcttaacc 
ttttgtttgt-i 
tttttaacat 
caaaacattg 
gaggtgatcc 
acttttacct 
ccataatcga 
ccaatttaca 



atatatccca 
tggtgctgtg 
agcctgtgag 
tdtgcagtgt 
aatgcpcacc 
tgttagaagt 
tccaaagtct 
anttgctcgt 
tgttgctatt 
ttatatgttg 
ttgtttgcaa 
tgtttttgtg 
"aaacttctgt 
tattcpatgg 
tttaggtagt 
gacagtacag 
ctaaaaaaaa 



acctacagca 
aatggagctg 
agctgctatg 
agattatgtg 
cag'freagaag 
cacgtgtccc 
gcagtgaaga 
caggtctgca' 
aattatgctg 
tgcttctgac 
taaacataag 
"tgctgecatt 
actctttacc 
ggttttgtga 
gccacaatgt 
tccagtctta 
aaaaaaaaaa 



aaccaaatcc 
tggggaccac 
ctacfacagtc 
caat'ttgttg 
aagagaag^t 
gccagfgccat 
cccgccaagc 
1 aaaataccct 
ccattagggc 
'cattttctct 
ttcttgtgta 
.tgtatcatgc 
agagagtagt 
tggaattgcc 
aacccctaag 
catggattca 



1080 
1140 
1200 
1260 
1320 
13 80 
1440 
>1500 
1560 
1620 
1680 
174Q 
1800 
1860 
1920 
1980 
2031 



<400> 94 

aagcgcctcc 

accgagccga 

tgcacgcgct 

agctgcggct 

gcccggtgcc 

tgcagacggc 

tggcattcct 

agatggatag 

ttggctggga 

tcgaccctcg 

caccccgcca 

cgtctgctct 

agttacctgc 

catctaccct 



gtggacacgc 
gtcctgtcct 
cctgatgcgg 
gctggtgtgc 
cttccccgaa 
gtcttacatg 
aatcagcctc 
ccccatccta 
tgacgacgaa 
ggcctcgggg 
ggagccactg 
ctctcttcat 
tgctggggtc 
ggccatcccc 



acttcctgcg 
tccaggccgt 
caccgcgccc 
gagagggcca 
acgtttaatg 
ctcgtgaacg 
ctcaccgggg 
ggtgattacc 
gacgacgacg 
gggagggccc 
ctctccccct 
ttctccgtag 
ggggctggag 
gggatccctc 



aggcctccgt 
tcgcaatggt 
tgagcatcga 
gcctgctgcg 
gcgagagctc 
agaaccgatt 
aagccgagga 
gggccttcct 
acgaagaaga 
tgcacgccgc 
tgccctccgg 
tgcttgtctt 
cctcactcac 
ccctgctaat 



<210> 95 
<211> 2070 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2668536CB1 

<220> 

<221> unsure 

<222> 2058, 2067 

<223> a, t, c, g, or other 



gcgcaccttg 
ggatgagttg 
gaacagccag 
ccaggtacgt 
ccggctcccc 
ctgcaacgac 
gtgggtggtg 
cgatgagatg 
ggaggatgat 
caccccctcc 
tccccttacc 
tgttccagga 
tcggaagtgc 



gccgagccga 
gtgctgctgc 
ctcatggaac 
ccgccgagct 
gagtttatcg 
gccatgaagg 
ccc tacatcg 
aaacagtgct 
tattaggccc 
ccgcagccct 
tacccgcgcc 
atagcgctcc 
ttggaagtgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

820 



<400> 95 

gatggccgca 

taacattttc 

aagaaaattc 

actgggataa 

ccgaggcccg 



gtcggcaagg agagacgtcg ctgaggggct 
agagaacctt ttggaaagaa caagcctact 
aagcccttca gtaacttcag caaacctgga 
gaaagacttg gctcatacac cctcacaact 
gtaccgccga gagggcgctc ggttcatctt 



tgcctgaagc gaggggattc 60 
tcaataaatg aaggagaata 120 
ccacacaaag ccatgttggt 180 
tgaaggactt gatccagcca 240 
tgatgtgggc acacgnttgg 300 



82/93 



WO 01/07471 



PCT/US00/19948 



ggctacacta tgataccctg gcaactggaa taatttattt tcatcgcttc tatatgtttc 3 60 
attccttcaa gcaattccca agatatgtga caggagcctg ttgcctcttt ctggctggga 42 0 
aagtagaaga aacaccaaaa aaatgtaaag atatcatcaa aacagctcgt agtttattaa 480 
atgatgtaca atttggccag tttggagatg acccaaagga ggaagtaatg gttctggaga 540 
gaatcttact gcagaccatc aagtttgatt tacaggtaga acatccatac cagttcctac 600 ' 
taaaatatgc aaagcaactc aaaggtgata" aaaacaaaat tcaaaagttg gttcaaatgg 660 
catggacatt tgtaaatgac agtctctgca ccaccttgtc actgcagtgg gaaccagaga 720 
tcatagcagt agcagtgatg tatctcgcag gacgtttgtg caaatttgaa atacaagaat 780 
ggacctccaa acccatgtat aggagatggt gggagcagtt tgttcaagat gtcccggtcg 840 
acgttttgga agacatctgc* caccaaatcc tggatcttta ctcac'aagga aaacaacaga 900 
tgcctcatca caccccccat cagctgcaac agcccccatc tcttcagcct acaccacaag 960 
tgccgcaagt acagcagtca cagccgtccc aaagctccga accatcccag ccccagcaga 1020 
aggaccccca gcaaccagcc cagcagcagc agccagccca acagcccaag aaaccctctc 1080 
cgcagcccag ttctccccga caggttaagc gagccgtggt tgtttctccc aaagaagaga 1140 
acaaagcagc agaaccacca ccacctaaaa ,tccccaaaat tgagaccact catccaccgt 12 00 
tgcctccagc ccacccacct ccagaccgga , agcctcccct cgctgctgcc ttaggtgagg 12 60 
ctgagccgcc gggccctgtg gatgccactg acctccccaa agtccagatt ccccctccgg 1320 
cccacccggc- ccctgtgcac cagccaccgc, cgctgccaca ccggcccccg cccccacccc 1380 
cctccagcta catgaccggg atgtccacca ccagctccta catgtctgga gagggctacc 1440 
agagcctgcar gtccatgatg aagaccgagg gaccctccta. cggtgccctg ccccccgcct 1500 
acggcccacc tgcacacctg ccctaccacc cccatgtcta cccgcccaac ccgcccccgc 1560 
cacctgtgcc tcctccccca gcctccttcc' cccacctgcc atcccacccc ctactcctgg 1620 
ctacccccaa cccgcaccca cctacaaccc caact^ccca cccccacccc cacgcctccc 1680 
gcctacccac gcagtccccc ctcatcctcq tccagggttg ggcctgccgc cagccagcta 1740 
cccacctcct gccgtccccc ctggaggaca- gcctcctgtg cccccgccca ttcccccacc 1800 
cggcatgcct ccagttggag gggctggggc gggcagcctg gatgagataa cgtgagcctt. 1860 
ttttccctct ttgttttttt aacaagattt tctaatcgac ttgcagagta gttgaagtgg 1920 
gtaagcagca gggtaccttg tataatgcac gacagttgca gtatgggaag aatggaccgg 1980 
gcccctggga taaaatcaga gtggtcctca cacctagagg acggggacaa ccagctttca 2040 
gagtagcctc atcagtgncc ttgcagnctg 2070 



<210> 96 
<211> 2046 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc__f eature 

<223> Incyte ID No: 2683225CB1 



<400> 96 

tgacatggat 

ccaccttgaa 

gttgaaacta 

gcccacctga 

agccagacta 

ctgtaaaaac 

ggggtcttgc 

accaagtaat 

agctattaaa 

aagagcaaga 

aagaagtcgc 

cccacagagc 

tcagccagca 

tggatgcctt 

acgtaagttg 

gtgagaacag 

tttttcttcc 

acaagtgttt 

tttaatctgt 

tttcacccct 

ccttgttggc 

tttcattcct 



ggggatgaet 
tgcccatgat 
attttgtgga 
ggatattgac 
catagaagat 
ctcccggaac 
tcctcagaac 
aaagcagaag 
acggcagcag 
aaatgccccc 
ccaagcccag 
tgtctgtggg 
gagaattttg 
gcctgtgaca 
gaatgatgga 
gaaaaaggaa 
attccaagtt 
aattcctctt 
tcactcactg 
gttgcagtta 
acttaattta 
tttggttctc 



tcctgggcta 
taagacataa 
tagagaaact 
cataaggact 
gacaatcctg 
catcaagatc 
aaaccagaat 
gaagaagaag 
aagttggagc 
gagtttgtga 
gagtcctagg 
tgccttctca 
acttgcagta 
gttgcagttc 
cctgt.gt.tca 
gaggacactg 
tcgtgggacc 
tctgttatct 
taaattttga 
acaccagaga 
gaaatagggt 
tgtgtaataa 



acacttctct 
acaagaaaag 
gttggaactg 
catatctcat 
aactcattag 
ttcacagaga 
tgcagaaggt 
cacagaagaa 
agcttgaac t 
aggtgaaagg 
ctgaggctgc 
atctcagggc 
ggatttggtt 
ctattcgcca 
gagacttaac 
gaatcaattc 
cagagccttt 
gttagataat 
ggacccagaa 
aggaacgtga 
gagaagttta 
tagcaggcta 



tgctaaagga 
agatatctga 
ctttacaaag 
tacaagaagc 
gcctcagaaa 
acttcttatg 
gatggaaaaa 
gaaatctgac 
tgagaagcag 
caatctcagg 
accaagacct 
aaaagcccct 
tgattttcct 
aatgaagggc 
agaaccaaca 
ttgagagttg 
tttcttttaa 
tgagatcacc 
ttgtcttgtt 
atgtcgagca 
aaagcccatc 
catagtgaca 



tgattttgca 
ggagaatgca 
tattttaaca 
atcatggccg 
ctgatcaatc 
aatcaaaaaa 
agaaaacgag 
ttggaaatag 
aaattgcaag 
agaacaggcc 
cgtgtgtcac 
ggagaatatt 
acgatctggg 
agtgccccgc 
agcaaaacaa 
cactacttgg 
aagctaaaaa 
tagaaatgcg 
taatttatac 
cagccactac 
ttgattttat 
ttccagttcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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aagaaggtac atcctgtcca ttcattaatt gctxtgatta ctaggagggt ttctgttcag 13 80 
ttttgtttf t aaatgtcttg ctgatctagt tctttcagat ggaataacct tccagtccct 1440 
tagagagtgg aactagtcca tataacccag ,cttcagt:agc aaaagtagaa gccgccacat 150a 
cttttcattt ctccaagagg agagtgggga aggttcccat gaccagctgg gcagtcagga 15 60 
tttctctagg cattctaat,g tgaaataagt gtagactgct gtcaaggagg cttcatcaga 1620 
agatgtatag catttgaa^g tctaatgata atgcatatca jttagaatcca agctttgaaa * 16.80 
atttctgatt aatgctrcatg tatttcttta tctttgtittt tccttgtgaa gaaagacttt 1740 
caccactgtc tgagtgatga f tgctgttgat aaggatgatg tcgatgacta ctatattgca 1800* 
tctctcagga acagctgatg ggaagggagg ggctgctgag ttcccttgtt ctagctagca 1860 
gcacgctcct caga-gagggg gccgagttac agacagcagc cgcattctca tgcaaaatta 1920 
gttt;taaacfe gctagtgtgg gcatcggtac ctttpgccfcg ggtgataccg aagaattgtt 198 0 
gaggatttag tatgctccgt agagacagtt catfccagtca tttctgcatt ggagagactt 2040 
CtCata ,- ' * ' . r . r 1 , i 2046 

<210> 97 . ; > ^ ' . , 

<211> 2660 
<212> DNA 
<213> Homo sapiens 



<220> - ' i " ■ i 

<221> misc_feature £ ■ 

<223> Incyte ID No: 2797839CB1 ^ 

< ■ 

<400> 97 

gtggcgagtg ccggccgaaa gctaggtccg gattgcacgt ggagggccgc ccgaagggca 60 
ctctcggaca ttaacccgca ttctgtacca tggggcgcaa gttggaccct acgaaggaga 12 0 
agcgggggcc aggccgaaag gcccggaagc agaagggtgc cgagacagaa ctcgtcagat 180 
ccttgcctgc agtaagtgac gaaaattcca agaggctgtc Cagtcgtgct cgaaagaggg 240 
cagccaagag gagattgggc tctgtcgaag cccctaagac aaataagtct cctgaggcca 300 
aaccattgcc tggaaagcta ccaaaaggga tctctgcagg agctgtccag acagctggta 3 60 
agaagggacc ccagtcccta tttaatgctc ctcgaggcaa gaagcgccca gcacctggca 420 
gtgatgagga agaggaggag gaagactctg aagaagatgg tatggtgaac cacggggacc 480 
tctggggctc cgaggacgat gctgatacgg tagatgacta tggagctgac tccaactctg 540 
aggatgagga ggaaggtgaa gcgttgctgc ccattgaaag agctgctcgg aagcagaagg 600 
cccgggaagc tgctgctggg atccagtgga gtgaagagga gaccgaggac gaggaggaag 660 
agaaagaagt gacccctgag tcaggccccc caaaggtgga agaggcagat gggggcctgc 720 
agatcaatgt ggatgaggaa ccatttgtgc tgccccctgc tggggagatg gagcaggatg 780 
cccaggctcc agacctgcaa cgagttcaca agcggatcca ggacattgtg ggaattctgc 840 
gtgattttgg ggctcagcgg gaggaagggc ggtctcgttc tgaatacctg aaccggctca 900 
agaaggatct ggccatttac tactcctatg gagacttcct gcttggcaag ctcatggacc 960 
tcttccctct gtctgagctg gtggagttct tagaagctaa tgaggtgcct cggcccgtca 1020 
ccctccggac caataccttg aaaacccgac gccgagacct tgcacaggct ctaatcaatc 1080 
gtggggttaa cctggatccc ctgggcaagt ggtcaaagac tggactagtg gtgtatgatt 1140 
cttctgtgcc cattggtgct acccccgagt acctggctgg gcactacatg ctgcagggag 1200 
cctccagcat gttgcccgtc atggccttgg caccccagga acatgagcgg atcctggaca 1260 
tgtgttgtgc ccctggagga aagaccagct acatggccca gctgatgaag aacacgggtg 1320 
tgatccttgc caatgacgcc aatgctgagc ggctcaagag tgttgtgggc aacttgcatc 1380 
ggctgggagt caccaacacc attatcagcc actatgatgg gcgccagttc cccaaggtgg 1440 
tggggggctt tgaccgagta ctgctggatg ctccctgcag tggcactggg gtcatctcca 1500 
aggatccagc cgtgaagact aacaaggatg agaaggacat cctgcgctgt gctcacctcc 1560 
agaaggagtt gctcctgagt gctattgact ctgtcaatgc gacctccaag acaggaggct 1620 
acctggttta ctgcacctgt tctatcacag tagaagagaa tgagtgggtg gtagactatg 1680 
ctctgaaaaa gaggaatgtg cgactggtgc ccacgggcct agactttggc caggaaggtt 1740 
ttacccgctt tcgagaaagg cgcttccacc ccagtccgcg ttctacccga cgcttctacc 1800 
ctcataccca caatatggat gggttcttca ttgccaagtt caagaaattt tccaattcta 1860 
tccctcagtc ccagacagga aattctgaaa cagccacacc tacaaatgta gacttgcctc 1920 
aggtcatccc caagtctgag aacagcagcc agccagccaa gaaagccaag ggggctgcaa 1980 
agacaaagca gcagctgcag aaacagcaac atcccaagaa ggcctccttc cagaagctga 2040 
atggcatctc caaaggggca gactcagaat tgtccactgt accttctgtc acaaagaccc 2100 
aagcttcctc cagcttccag gatagcagtc agccagctgg aaaagccgaa gggatcaggg 2160 
agccaaaggt gactgggaag ctaaagcaac gatcacctaa attacagtcc tccaagaaag 222 0 
ttgctttcct caggcagaat gcccctccca agggcacaga cacacaaaca ccggctgtgt 2280 
tatccccatc caagactcag gccaccctga aacctaagga ccatcatcag ccccttggaa 2340 
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gggccaaggg 
agaaacagaa 
gccgcccccc 
tatqttagat 
cttgcctctg 
aaaacetctg 



ggttgagaag 
tgataccccc 
accagcaaag 
ggttgaaaac 
tgaggatgcc 
aaaaaaaaaa 



cagcagttgc 
aaggggcctc 
aggaagaaat 
tagacpggtg 
ttctctactg 



cagagcagcc 
agcctcccac 
ctcagtccag 
gctcactgcc 
tgcataccca 



ttttgagaaa 
tgtgtctccc 
gggcaacagc 
attgtcacca 
tgaaatttaa 



gctgccttcc 2400 
atccgttcca 2460 
cagctgctgc 2520 
ggttggaaet -2580 
tacacatttt 2,640 
2660 



<210> 98 = 
<211> 4610 
<212> DNA 

<213>- Homo sapiens 



<220> " i 

<221> misc.feature 

<223> Incyte ID No: 2959521CB1 



<4O0> 98 

ggggcccgga 

gc.ttctgccg 

ctccgagcgc 

gagctgcctc 

atccctgggt 

tggcgagggt 

gggagtgggc 

ctttagttct 

cagcgagctc 

ctcggagaac 

tgaagtgtgg 

ggccagcgtg 

agctctggcg 

tggtttcaac 

ttatcttttc 

tcatcatgga 

gggggtgatg 

cagtgagctg 

tgtgcaggac 

tgatatcggc 

tggcctggcc 

ggccatcctc 

tgggatgagc 

gggtctggcc 

agctggagga 

ctgtcaagag 

cctcctgact 

tgggtagggc 

ccagcccgag 
acacagactg 
taagctgccc 
ctctcagtga 
ttaccttggc 
caaaaataga 
ggggcccatg 
gacaggaagc 
gaacctgaag 
gagaggaagt 
tgctggatat 
tggcggtgtg 
gctgtctgca 
tcttctaaga 
tgtgttagag 
caagctgact 
atggaatcag 
tagggggtct 



cgcatgaggg 

ccgccggccg 

caggtttatt 

ctgctgaccc 

gcaccagcta 

gacagcctca 

cggggtaatg 

ggagacctct 

caggagttct 

caggaaaacg 

ggatacggtc 

gtgcccttca 

attggaaccc 

cctctggaag 

tttttcacag 

cacagccatt 

gagaagctgc 

gacggcaagg 

ctgcaggctt 

actctggccc 

atcggtgctt 

tgtgaggagt 

atccaacaag 

tttggcatcc 

atgttcttgt 

gatgaaagga 

ggattcacca 

tctgccaaga 

gacttaccat 

tattcctgca 

tggtaaccag 

ctctggaacc 

ctcctcttgg 

gccaatggtt 

actctagctg 

cttcccattt 

tcagaacaca 

cgaaccactg 

tgattttgta 

gctgcctgga 

agaaaacagt 

gactttgtag 

gggagactgc 

tgtagaaatg 

tgcaggcaaa 

ctgaaatgta 



gggtcggcgc 
cctgggacct 
cagtcaccat 
tgcttggctt 
tcagcgctgc 
ctctgcagca 
tcacccagca 
tcactgccca 
gccccaccat 
aggagaatga 
tcctctgtgt 
tgaagaagac 
tctactccaa 
attattatgt 
agaagatctt 
atgcctctga 
agaacgggga 
cgcccatggt 
cccagagtgc 
ggatgatcac 
ccttcactgt 
tcccacatga 
ctctcttctt 
tggccggcag 
atatttctct 
agggcagcat 
tcatggtggt 
gcctgtggga 
ccacaatgca 
ttcaaatgtc 
tctctagcta 
tgaatgcagc 
aaccagtgct 
ataacttggc 
ggcaccttgg 
tttcaaagtc 
tgagcagggt 
ctgtgtgtct 
acagcacccg 
cctcctcttt 
tggtttggct 
ctgcctccta 
acaaactatc 
gggtgccat t 
atttaggatt 
agggcacaaa 



gcgtgtctac 

tgcggtgagg 

gaagctgctg 

atggagaacc 

ctccttcctg 

gctgaaggcc 

cgtgcaagga 

caatttcagc 

cctccagcag 

gcagacggag 

gaccgtcatc 

cttttacaag 

cgccctcttc 

ctccaagtct 

gaagattctt 

gtcgcttccc 

cctggaccac 

ggacgagaag 

ttgctactgg 

tctgagcgac 

gtcagttttc 

gctaggagac 

caacttcctt 

ccacttctct 

ggctgatatg 

cttgattcca 

cctcaccatg 

ctggaagtcg 

ccacggaaga 

agccgtttgt 

gtgcctcttg 

ttacaagaca 

gaaaggtttt 

tagaaatatc 

acctccagct 

tgtttaattg 

gagaggtgag 

tgtcaggatg 

gtgtttcacg 

caggttaacg 

gtgattttga 

gaagcacatt 

ctcccccagg 

tatgctctac 

tgccgcttcc 

cttcacttag 



gcggacgcac 

ctgcgcgggg 

ctgctgcacc 

acccctgagg 

caggatctaa 

ctgctcaacc 

cacaggaacc 

gagcagtcgc 

ctggattccc 

gaggggcggc 

tccctctgct 

aggctgctgc 

cagctcatcc 

gcagtggtgt 

cttaagcaga 

tccaagaagg 

atgattcctc 

gtcattgtgg 

ctgaaaggtg 

ggcctccata 

caaggcatca 

tttgtcatcc 

tctgcctgct 

gccaactgga 

ttccctgaga 

tttatcatcc 

tattcaggac 

ggccctgggc 

ggccgttcta 

aaaatgctgt 

ccctctcctc 

agcctgactt 

gaatccttta 

aagagttgaa 

ggccaataga 

cc tattactt 

gcaaggttca 

ctcacttgtt 

gctgtccgag 

ctgacagaat 

cctcctcttc 

ctgagcacat 

ttgagacgtc 

ttagacaagg 

ataaatcaaa 

ggcatcgcag 



cggctaagct 

ccgaggccgc 

cggccttcca 

ctcacgcttc 

tacatcggta 

acctggatgt 

tctccacgtg 

ggattgggag 

gggcctgcac 

caagcgctgt 

ccctcctggg 

tc tact teat 

cggaggcatt 

ttgggggctt 

aaaatgagca 

accaggagga 

agcactgcag 

gctcgctctc 

tccgctactc 

atttcatcga 

gcacctcggt 

tgctcaacgc 

gctgctacct 

tttttgcgct 

tgaatgaggt 

agaacctggg 

agatccagat 

tgcccgatcg 

tgaaaaactg 

atcctaggaa 

acctcctttt 

ttttctctga 

cccaacaatg 

tccatagtgt 

agagacagga 

ctctcaaaga 

tcctgaatgg 

cctactgaga 

tgagctaacg 

ggaggctcag 

cccactgcca 

ttgagacctc 

tgcagagtgg 

gtaatcagaa 

gcatgactaa 

atgtttgcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140- 

1200. 

1260 

1320= 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

27 60 
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aatggttggc 
" ttccttgcaa - 
cagaaaatgg 
gcccttgcct 
- gatttattat 
tccagaagcc 
attcatgtgc 
catagaaaaa 
gggagacatc 
gtgatacttt 
acgctgaaga 
ctggtgggag 
aactttttat 
atttttctca 
agattttgct 
tagatcctat 
gttagtggat 
atgaaatagg 
atagacagct 
tagtcggagc 
gtggcaacag 
gaatttgctg 
aggacttttt 
tattaataag 
ggttctgctc 
ggtattttgt 
taaaaagaag 
aagagatggc 
aggttaggat 
ttttaaataa 
taaatgtttt 



ccaatgatta 

aaaaagtcga 

atggagataa 

ccgagctctg 

tatcatattq 

tttcttag££ 

atggctgaca- 

ctgtccgctfc 

tgtcaaacca 

fc'tttaagttt 

gggaaaattt 

gattatggtc 

attgtcatta 

agctgctttc 

ttaacctaac 

cattccttaa 

ttctagttta 

gtgttttccc 

gcctcaaagg 

ttaggagggg 

ctttgcctca 

tttcaagcat 

cacccctggc 

ttagcaagga 

cttctcactt 

gtctagtgtc 

gcttctctgt 

aatatgctgg 

tttattgttc 

ctcaggtgta 

atttgtaaga 



tgctacagat 
atcctgcatt 
cttgtcttta 
gcttcaaggg 
ataatgtgag 
gtgcccacag 
ggagtactgg 
tcagtaatca 
ggaatattct 
tgttrtttatc 
cagtgatgga 
'cagttttacc 
ttattgttgt 
aggagctagc 
tcgcattgat 
acataatacc 
ggagaggagc 
tttttgtgca 
gaaatcctct 
cggagacgct 
tgagtcaaaa 
ttgtacatat 
attagcagct 
aagtgtatgt 
caccaccggc 
aaattggagc 
ttgggtagcg 
gcatctactt 
ttatttccct 
tgagaagaaa 
ttctataaat 



( 

pggttttaaa. 
' gaattgaata 
aaactgtagg 
gagctcttct 
attctttagc 
ttggagccca.« 
ttcactacca 
caagcagcat 
tgaaaagaac 
ttgicctg'ttg 
gattctagat 
aaagaaccaa 
ttttaaacgg 
'agaaaataac 
gtattaaatt 
fctttgtcttg 
tcaaaactat 
cacctatatt 
ttaaaccgta 
cacatcgtct 
attggcaatt 
tagaagtcta 
tcgacctcat 
cacgtgcagg 
acacagcttg 
tattcttcac 
taagagctga 
taaaacaaag 
ttacagttct 
ttagaaaaga 
aaagctatat 



tgacccgtct 
tgaatttctc 
ccagccttag 
ccaggttcac 
1 cactttgggg 

» ggggccatgt 

atgcctgagc 
ccgttttgtt 
gtgagcagga 
gcttcaatac 
taaatatcag 
ttccttgaat 
ttctttgtct 
tcaaagttga 
tataatttta 
gagtagaata 
aatctttaac 
accttaagaa 
gttggcgcag 
gacttgagtc 
tcttttgatt 
aggagtagca 
tttccagatg 
aacagtgagg 
cccctgtctt 
tggtccttaa 
gtatagtaag 
ttgtctgatt 
gcagttccat 
aaattaactt 
tctgtaaaac 



aggttactgc 
taactctctc 
ccactgtgga 
taggtgaatt 
agcctgtctc 
ttgcaaactg 
ttttctctta 
. t'tctcttctt 
aaaactgctg 
atttgagaat 
gactgatttc 
gttrggaatct 
tttctgtttit 
agactctgga 
gcattcccaa 
ctaagttaga 
aaattgaaaa 
att^ccttcc 
aggtcagtcc 
gccactigatt 
tttagttgtt 
agtcagtggg 
caccagctcc 
cagggacagg 
tgcccccaaa 
ccttgggttt 
tcctcttcca 
tttgcaagag 
cacagtattt 
atgtggactg* 



2820 
2 880 
2940 
. 3 000 
3060 
/ 3120 
- 3180 
32^0 
3360 
3360 
3420 
3480 
3540 
3600 
lv 3660 
3720 
3780 
3840 
3900 
3960 
4020 
4086 
4140 
4200 
4260 
432G 
4380 
4440 
4500 
4560 
4610 



<210> 99 

<211> 1889 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3082014CB1 

<220> 

<221> unsure 

<222> 1809, 1848, 1853, 1880 
<223> a, t, c, g, or other 



<400> 99 

acgtagaaga tggagagaag gaggtggtat 
ctcaccgggg ttttccaatc tctctcctat 
gagacagata tcttgagcca gtacctggaa 
tttgaccccg tgcagaagac acttctcgct 
taccaagctg agatcccaga tcgcctagta 
aagatggaga tgaaggtctg ggacccagac 
tttcttgtgg tggcccgagc tgtgggaacc 
attcggcagc caagcttgca catgagtgca 
cacgccatgg ataccttgca aaggaacggc 
gtaccccagg gaggcccggt gctgtgtcgg 
gccatgctat ttgaggaggc cctagagaag 
gattttctac cctggaagtc acttgccagc 
acagaccggt atattcagca gaaaaggttg 
caggtctaca ttcccaccta cactaagcca 
aaacctggca tgaatggggc tggatttcag 
acacagtctg ctcagtggta tgcctggggc 



tgagagatgc tgtggggagg gagagtgggg 60 
agggggaaat gcagtgtgac cctcttgaat 120 
aaggaggact gcttttttta ctcactggtg 180 
gatcagggcg agattagagt tggttgcaaa 240 
gagggagaat ctgataatcg gaaccagcag 300 
aaccctctca cagaccggca gatcgaccag 3 60 
tttgcaagag ccctagattg tagcagctcc 420 
gctgctgcct cccgagatat cactctgttt 480 
tacgacctgg ctaaggccat gtcgaccctg 540 
gatgagatgg aggaatggtc agcctcagag 600 
tatgggaagg acttcaatga tattcgccag 660 
atagtccagt tttattacat gtggaaaacc 720 
aaagctgctg aagcagacag caaactgaaa 780 
aaccctaacc agatcatttc tgtgggttca 840 
aagggcctga cttgtgagag ttgccacacc 900 
ccacctaaca tgcagtgccg cctctgtgct 960 
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tcctgttgga 

gccactcggg 

agtctctctc 

actttcctgc 

ttacagccaa 

gcagagtgct 

gtgcggctgc 

aaaacacctc 

ctggggggca" 

tctgccaatg 

cgtcagaaac 

accagggccc 

aacttgcccc 

gcaccctcac 

ggaaaggang 

tttcgtgtcg 



tctactggaa 
gcaccacgga 
cttacacaac 
ttcagaccac 
ggagggccgc, 
ccattcgact 
ccctggcaac 
ggggtaccaa 
ttatggtgaa 
gaaggcctct 
taaacccagc 
tacggaaggc 
tgaaggtgaa 
atcctgccag 
tgggctgaga 
aa^ggagtgan 



gaagtatggg 
gccacactca 
iCagcgccaac 
aaagctgacc 
ccgacggcct 
tcctaaggcc 
tatcgtcaaa 
gacaccgatc 
acgggcctat 
ggcttcaggg 
tgatgccccc 
tctgacccat 
gccaacgctg 
caccaatgag 
aggtagaggt 
gagtgatta 



ggactgaaga 

aggggtcatt 

agggccaagc 

cgtcttgcca 

tatgctccta 

gccaagactc 

gatctggtgg 

aacagaaacc 

gagactatgg 

attcgttcaa/ 

aatcctgtgg 

ctggaaatgc 

attgcagtgc. 

cctattgtcc 

ggatgcccag 



ccccaactca 
tatccagacc 
ta^ctggctaa 
gacgcatgtg 
tcaatgccaa 
cattgaagat, 
cccaggcacc 
agctgtccca 
caggggcagg 
gcdcacagcc 
tgtttgtggc 
ggcgagctgc 
ggccccctgt 
tggaggactg 
ggcaccanaC 



gcttgagggg 
tgaagctcaa 
gaacagacaa 
cagggaccta 
tgccatcaaa 
tcaccctctg 
cctgaaacca 
gaaccgggga 
ggttcctttc 
agcagccaag 
cacaaaggat 
tcgccgaccc 
ccctctacct 
agcactgttg 
ctncccttcc 



1020 
1080 
1140 
1200 
1260 
1320 
^1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1889 



<210> 100 
<211> 2032 

<212> DNA f 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3520701CB1 



<400> 100 

gccggggccg 

gggcggcagt 

gagctggaga 
tctcagggct 
gaagaaccag 
tttgagccat 
ttggaatgca 
aacttttttg 
ccagatgaga 
gccattcccc 
tgttcttttt 
gatggattgg 
ggagagcatc 
cgggaggtta 
ctccagacag 
caggagctta 
tggcgtagca 
ttattcctga 
caggccactg 
caagtggaac 
agat ttgctg 
tttcagtcaa 
gtatgaagct 
gtgtggttca 
ttcattctct 
tgtgactcag 
acccagggtt 
cttgtttaat 
cttccagaac 
acaggtcccc 
tgctccttct 
ggcttgtttt 
atgtctgtca 
tgaattttac 



agccgctgtt 
cggagctgga 
atgaggcgtg 
cagtaaagag 
caggaatttg 
acacaaaaag 
aattacttcc 
gattgtactg 
tgatccagtg 
ctgagagtgg 
tgtgggctta 
tgcggaacat 
aagatagtac 
aagtagagca 
tgtttaagaa 
aagctaagca 
agttgtgtac 
cagatgaata 
acaggagcga 
tcatttgtga 
atgaaggcac 
aaaagagaag 
ttctcattca 
catttggccc 
ttagctgcag 
ctgatgcagc 
atttgcatag 
gatctgttat 
ctcctccgca 
tcgcccctat 
ttcacgtccc 
ctctaggccc 
caggcggaga 
taatacagtt 



cggctgacag 
gcccgtggta 
cgctgtcctg 
acaagcacta 
tttagct tgc 
aaattttcgt 
tgacaaagca 
catttgcaag 
cgtagtctgt 
ggattttcag 
tgqtgcacaa 
tgatggaata 
cctcaaagag 
gaacagtgaa 
tgaaagcctc 
gcttataaag 
ctgccaagac 
cgacacagtt 
tcccctaatg 
atacaatgat 
ggttgtnaag 
aagagtggat 
agccaatgaa 
cctttccgtc 
tagccaccgt 
tcattccaca 
tttttaagtt 
ttcactccca 
ggcatcacgg 
gattgtggtg 
tgttttctga 
aacctccaga 
gattaacaga 
caagtgaaat 



ttgaggatgg 
tcgttggtcg 
ggcggcagcg 
tatgcctgta 
agttatgaat 
tgtgattgtg 
aaggtaaatt 
agaccttatc 
gaagactggt 
gagatggtat 
ttggcagtaa 
ggtgatcagg 
gatgttccag 
ccatgtgccg 
aacgcagaat 
aaagacactg 
tgtatgaaaa 
ctggcttatg 
gataccctta 
ttgaagactg 
agagaggaca 
gggatgcagt 
aatgcgcttc 
ctcctctgtt 
gtggatgctg 
gacttcgcca 
tgattttgtt 
gatgtgtgtg 
aaggctctct 
ccttgggtca 
ggtttggtca 
gtagccagga 
tgacagggtt 
tttcgttcat 



ccggagccga 
acgtccttga 
actccgagaa 
gtacctgcac 
gtcatggaag 
gaaacagcaa 
ctggcaataa 
ctgatcctga 
tccatggaag 
gccaggcctg 
ccaaaatatc 
aagttatcaa 
aacagggaaa 
gctctagttc 
caaaatctgg 
ccacctattg 
tgtatggaga 
aaaacaaagg 
gcagcatgaa 
aacttaaaga 
ttcagcagtt 
attactgcag 
ccattcttgg 
tggagaggcc 
acttcacagc 
gtgtactcct 
ttgagaaagc 
ttttgccaca 
tcccgtcacc 
aagcttcctc 
tagcttagaa 
ctgatggttt 
gaggaagcaa 
gattctattg 



gggcgccgct 
ggaggacgag 
gtgctcctac 
cccagaggga 
tcacaaacta 
gtttaaaaat 
gtacaatgac 
agacgagatt. 
gcatcttggt 
catgaaacgt 
cactgaggat 
acctgaaaat 
ggatgatgtc 
tgaatctgat 
ctgcaaactt 
gcccctgaac 
tctagatgtc 
gaagattgcc 
tagagtccag 
ctatctcaag 
ctttgaagag 
ctagagtgga 
aataaaagag 
tcgcgctccc 
cagcgtcctc 
actccagtgc 
aaattggtgt 
gagctgttgc 
tagaacctct 
aagcctggtc 
aggatcttgg 
tctggtctgg 
gcctttgtta 
gc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2032 



<210> 101 
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<211> 1356 

<212> DNA • 

<213> Homo sapiens 

<220> 

<221> misc„feature 
<223> Incyte Ii) No: 



4184320CB1 



<400> 101 

aatgaaagca 

aggctcagaa 

ccagcaccaa 

tcacccccag 

acatccagtc 

ttgggaccat 

cttttgatac 
ttcccaagct 
tcaagcaagt 
tcactggagg 
cctctaagtc 
accaggagga 
tgagcaagct 
ccactggaaa 
tccagaagat 
cattttgggg 
cactcatagt 
cccctcccgt 
ttgggcataa 
ctaaagcctg 
cctcaaatca 
tggtgaagaa 
tacacggata 



acaggagctg 
atcctggata 
agacccccat 
tatctccgtg 
tgaaaagtgg 
tgagttccaa 
aaaacctgat 
tctcatctct 
ctttg^gaaa 
ggttaacaca 
tcgaggaaag 
cctcattgga 
cactgttctc 
atatggagca 
aaacacaaga 
tagttgccag 
ggaaggagga 
gccagtggtt 
atactcagaa 
tttggaaacg 
aaacaagcta 
cagagaaaca 
ataaaaaggg 



ctccggggac 

gaaagagcat 

aggtgttgct 

cttcagaatg 

tccatcagca 

ggaggtggcc 

ctcctcttac 

gjtccatgggg 

gggctcatca 

ggt^ttattc i 

atatgcatca 

agagatgttg 

aacagcatgc 

gaggtgaaac 

tgcctgccgt 

ttagactcag 

cccaatgtga 

gtctgtgatg 

gaaggcgggt 

agggtatgag 

tgaacaaatt 

tattctaaat 

tgaggaaaag 



tgcttttgcc 

tttataaaag.. 

gtgggcgtct 

agaaaaatga 

aacacactca 

attccaaxraa 

acctgatgaq 

gcctgcagaa 

aagcagctat 

gtcatgttgg 

taggtattgc 

tccggccata 

attcccactt 

ttcgaagaca 

ttttctctct 

ttggaatcgg 

tctcgattgt 

ggagtggacg 

aggtaacttt 

tggattatgt 

gtctaaaaaa 

gtcctgtgaa 

agagga 



agtacccaga 
agaatgtgtc 
gataggccag 
aagtdgcctc 
actcagccct 
agccatgtat 
caaggaatgg 
qtttgaactc 
gacaactgga 
cgatgccttg 
cccctgggga 
ccagaccatg 
cattctggct 
actggaaaag 
tgactcccgc 
tcaaggtgtt 
tttggagtac 
ggcatcggac 
ccaggcccca 
tttcagtagc 
tgtctgtcat 
gtgggaaatt 



atcagtgptc 
/cacatca'tac 
catgttggcc 
tcccgaaatg 
acggatgct s t 
gtgcgagta.t 
cagttggagc 
cagccaaaac 
gcgtggatat 
aaggatcatg 
attgtggaaa 
tccaafcccca 
gacaacgggfe^ 
catatttcac 
ttgttttatt 
cctgtggtgg 
cttcgagaca 
atcctggcct 
tggaagaacc 
tcaaccaaga 
gggagggctg 
ctatgaaagc 



60 
120 

;180 

240 
,300 , 
360' 
420 
480 
540 
600' 
660 
720 
780 
840 
9 OX) 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1356 



<210> 102 
<211> 580 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_ feature 

<223> Incyte ID No: 4764233CB1 



<400> 102 

cacaacgcag 

ttcactgggc 

gagtgtgagg 

taggtccgca 

atcaacatat 

gcttgaaccc 

tacacctgat 

ggaagccgat 

tgatgtcaag 

agggaaatca 



gcaccgactt 
atcttccctt 
ggcacgttcc 
ggccagtcct 
aggcctaggc 
actgatgaag 
cagaagagag 
ctccaggagc 
gggaagattc 
caggtttaaa 



cagtgtgcat 
cgaccccttt 
agccgtctgg 
cccaggaact 
caagaagaag 
agcctaaaga 
aagatgatca 
tatgtcagac 
taccaaaagc 
ggaagataag 



gttccttgga 
gcccacgtgg 
actctttctc 
gaaatagtga 
tttacagcct 
agagaaacca 
gggtgcagct 
aaagactggg 
agagcacttt 
ctgaaacaac 



cacctgcctc 
tgaccgctgg 
tcctactgag 
aatatgagtt 
cctgagctga 
cccactaaaa 
gagattcaag 
gatggatgtg 
aaaatgccag 



agtgtgcatg 
ggagctgtga 
acgcagccta 
ggcgaggaag 
ttggggctat 
gtcggaatcc 
tgcctgacct 
aaggtggtac 
aagcaggtga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

580 



<210> 103 

<211> 1487 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



4817352CB1 
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<400> 103 

ccgggaggcc ggggtctcgg gtggccgccg 
aaggcgaagg tccccgcctc caagcgcgcc 
gtcaagacgc tcactcggaa gaaaaacaag 
cgggaagtaa gcaagaagcc agcaagcggc 
ccagaagact tttctcaaaa ctggaaggcg 
caggccccag aaaagcctct tgtcatctct 
atccagcaaa acaaaaaaga gacctcgcct 
aaagaccagg aggccagcag gggctctgtt 
ccagtacctc gcaccaaggc cagtggaafca 
acaaatggtg atattgttcc agaacgaggg 
gaggcagccc cagccccacc caccgaggaa 
gatatcgaag ctgccatagg tccagaggcg 
agcgagggca gcgtcagcct cagcctcgtg 
gccttagcct tggactgtga gatggtgggc 
gcccgtgtgt ccatcgtgaa ccagtatggg 
accgagcccg tgacggacta taggacagcg 
cagggagaag agcttgaagt tgttcagaag 
ctagtggggc acgctctgca taatgaccta 
aagattcggg acacacagaa atataaacct 
tctctgagac tactttcaga gaagatcctt 
tcaattcagg atgcccaggc agcaatgagg 
agcacggccc gagacaggcg ccccctgctg 
tagcagtcct gccctgctgc tgctgccgcc 
ggacagatca catctcccca gagtggcaac 
aggggcgtgg cgtggtgcta ctgagaagac 



gcccaggcgc tggacggcag caggatgggg 60 
ccgagcagcc ccgtggctaa gccgggtcct 120 
aagaaaaaaa ggttttggaa : aagcaaggcg 180 
cccggtgctg tggtgcgacc tccaaaggca 240 
ctgcaagagt ggctgctgaa acaaaaatct 300 
cagatgggtt ccaaaaagaa gcccaaaatt 360 
caagtgaagg gagaggagat gccggcagga -42 0 
ccttcaggtt ccaagatgga caggagggcg 4 80 
gagcacaata agaaaggaacf caaggaaagg 540 
gacatcgagc ataagaagcg gaaagctaag 600 
gacatctggt ttgacgacgt ggacccagcg 660 
gccaagatag cgaggaaaca gttgggtcag 720 
aaagagcagg ccttcggcgg cctgacaaga 780 
gtgggcccta agggggagga gagcatggcc 840 
aagtgcgttt atgacaagta cgtcaaacca 900 
gtcagtg i gga ttcggcctga gaacctcaag 960 
gaagtggcag agatgctgaa v *gggcagaatt 1020 
aaggtactat ttcttgatca tccaaaaaag 1080 
ttcaagagtc aagtaaagag tggaaggccg 1140 
gggctccagg tccagcaggc ggagcactgt 1200 
ctgtacgtca tggtgaagaa ggagtgggag 12 60 
actgctccag accactgcag tgacgacgcc 1320 
ccgctacaga ggcaatgtga ccagtcacag 13 80 
tctggtgaaa ccttttcaga atcatggcag 1440 
ctccttcgtg ttgacga 14 87 



<210> 104 

<211> 2257 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc.feature 

<223> Incyte ID No: 5040573CB1 



<400> 104 

gcgccggctc 

aaagggctcc 

agcatggctg 

accaatctct 

tgcccagcta 

aacccttgat 

acagaatttt 

ctttgctgtg 

ttggcctgaa 

caaagccatg 

caaattacga 

gccggaagtc 

ggcctgggcg 

gctgaaggta 

agtagaacgc 

ctttgtgatg 

agaagcagag 

ctccagccgg 

ttatcctctg 

ggaaagggac 

gctgctgtgg 

ggtgcaccac 

aaacaacccc 

ctacctccca 

ggtcctcaca 

gctcggaaag 



cggctgcagt 

cgcaccgccc 

ttataaatga 

ctgtctttaa 

agacctgaat 

ggatttgttg 

catccattaa 

cttctatttt 

gtgaaaacca 

tttttggctg 

agggaatggc 

gtcttcatcc 

gatgatgtgg 

gttgctggaa 

tttgagaaag 

gtcaacagcg 

ctcattgaag 

tgtggacctg 

tatcggagaa 

atcccattta 

tggctccagc 

gggggccgag 

agtttcatca 

cgtgaggatg 

ctcactcact 

cgtaagacaa 



tcccgggtcc 

ggtgctcccc 

attctgattt 

aagatgaggt 

gccatcaccc 

ttgcttgaga 

agaggaagag 

gtgaattttt 

cagcctctga 

acacccattt 

agatggagag 

tgggggatat 

agcggtttca 

accatgacat 

tgtticagctc 

tggcgctgaa 

tttctcacag 

ggcctctgct 

gtgatgctaa 

aggagaacta 

cgcgcctggt 

tccccgagct 

tgggtagcat 

tggttttgat 

ttgggcttct 

gatgaagagc 



ctcggccacc 

atctgcctgg 

tggggagcag 

gacttggtga 

tccccagggc 

aatggcgatg 

ttcattgctg 

aacctattac 

tggtgaacag 

gcttggggaa 

agcgttccag 

ctttgatgaa 

gaaaatgttc 

tggcttccat 

tgaaagactg 

cggggatggc 

actgaactgc 

gcccacgtct 

ctgttctggg 

tgacgtgctt 

tctcagtggc 

cagcgtccca 

cacgcccaca 

catctactgt 

agcctcacct 

aggcgccatt 



gaagccaccc 

cgttgtgcgc 

atgccaactt 

ttttcctgga 

tctgcagttt 

atcgaattgg 

ttgaaactca 

ttagcgatct 

accacacgtg 

ttcctaggcc 

acagctctgt 

gggaagtgga 

agacacccaa 

tatgagatga 

ttttcttgga 

tgtggcatct 

tcccgagagc 

gcccctgtcc 

gaagacgctg 

tcacgggagg 

cacacgcaca 

tctttcagtt 

gactacaccc 

ggagtggtgg 

tttctttctg 

ataaatatca 



tgccctggtg 

agagctggaa 

agagcctcgt 

aaattatagg 

tctcgtggtg 

ggtttggaag 

tagctgttgt 

ttcagtgtaa 

agcctgtgct 

actggctgga 

ggttgctgca 

gcacccctga 

gtcatgtaca 

acacatacaa 

aaggcattaa 

gctctgaaac 

aggcacgtgg 

tcctgcagca 

ctcctccaga 

catcacaaaa 

gcgcctgcga 

ggaggaacag 

tctccaagtg 

gcttccttgt 

gtttgaactt 

aagcccaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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aatggaactt 

cgtctctctg- 

gttgattgta 

ctcattttta 

ttaatgtcgt 

cagcacacac 

gtgaagtggt 

gccgggcaca 

cacttgaggt 

gaaatacaaa 

caggagaatc . 

actctagcct . 

<210> 105 x 

«211> 2550 

<212> DNA 

*<213> Homo 



tgggcagaga 
cacagcacag 
ctgttctcat 
tcaaatatat 
cactcacatg 
ttgggattat 
tctcaaattp 
gtggctcaca 
cag^agttca 
aattagccaa 
ccttgagccc 
gggcaacaga 



sapiens 



tcatgttaga 
aaattctcaa 
gctataaaag 
gtatcatcaa 
cacgacagtc 
gtgtatacat 
c'tatgtactg 
cctgtaatcc 
agaccagcct 
gcatggtagc 
gggaggcgga 
gtgagtccgt 



atcaagtg.ga 
tcactgaaat 
tggacaggta 
aggttgcatc 
cttgttcccc 
aaataaatat 
taaagctgta- 
cagcactttg 
ggccaabatg 
aag'tgcctat 
ggttgcagtg 
ctcaaac 



.tgatgagacc 
gagtaactgc 
ctctacaaca 
tgtacagtat 
caggaagggc 
tgggctgttt 
cccttaaaag 
ggaggctgag 
gtgaaacctc 
aataccagct 
agccaagatc 



aattacaggc 
aaaataaata 
aatctgtttt: 
gtaaatgcta 
ctggtggccc 
ccctcttcct* 
tacagatgtg 
gcgggtggat: 
gtctccgcta 
gaggctgagg 
atgccactgc 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2J60 
2220 
2257 . 



<220> 

<221>. misc_feature 
<223* Incyte ID No: 



5627029CB1 



<400> 105 

cggaagtatt cccattttgc gttgtctggg 
gccactatgg ggtcgtccaa gaagcatcgc 
-gcggccggca ccgggggtgc caccgagcag 
aagcaccgga gtggcggcag tggcggtagc 
cgtgggggcg agcgcgggag cgggcggcgc 
cacgggcggg agcgcagcca ggcagagccc 
gatgacggct acgaggccgc tgccagctcc 
agcatcgagg agactaacaa actccgggca 
gccatcaaga aggaggcggg caccaaggag 
atggccttgc gacagcgaga ggagctgcgg 
ctgctgaacc aaaagctggg gaagataaag 
gacactgcag cctggatcga gaggagccgg 
aagagggcca agttactgga ggagatggac 
gaggagttcg ggcagaggcg gcaggacctg 
gtggagcatg ccattgattc cttccgagaa 
aaaggcgtgc tgcaggagga ggaggacgtg 
cgggcagaga aaaatgtgga gctgcggaag 
gacgagagcg tggacgacct ggcgcagcaa 
gaagagcttg aaggggagcg gccacattcc 
. ggcctgcggg agcgggagct ggaggagatc 
ctgagcacag tggggccccg gctggcctcc 
tttaaaaaga ccaagcggag ggtgaagaaa 
cgggcagatg acttgctgcc tctcggggac 
ctgcggggac ggggtcgccg ccgagtgtcc 
cagcccctgc cgtcggacga cacccgagtg 
ggtggagctc caccgccggc gtccccgcag 
ctgcagaagc agctggagaa gggacgccgg 
cgagacagtg gcgagaaggt ggtggagatt 
tgggaggagg atgaggatcc cgagcggaag 
ttctgccgca ccttggggga gatccccacc 
gaggagctca tggactttga acgggatgag 
gacggggagg agaacatcgg ctggagcacg 
gatttctctg cttcctccac caccatcctg 
gcagctgccc tgctcctgtg tcagaacaaa 
gcccgggtga aggcccccaa caagtcgctg 
atggccatcg atgacaagta cagccggagg 
aaggagaagg acggctacaa acccgacgtt 
aaactcacac ccaaggaggc tttccggcag 
ggcaagatga agacagagcg gcggatgaag 
atgagctcca gcgacacgcc cctgggcacc 



ctcggcggca gccgggctcg gagtggacgt 60 
ggagagaagg aggcggccgg gacgacggcg 12 0 
ccgccgcggc accgggaaca caaaaaacac 180 
ggtggcgaac gacggaagcg gagccgggaa 24 0 
ggggccgaag ctgaggcccg gagcagcacg 300 
tccgagcggc gcgtgaagcg ggagaagcgc 3 60 
aaaactagct caggcgatgc ctcctcactc 420 
aagttggggc tgaaaccctt ggaggttaat 480 
gagcccgtga cagctgatgt catcaaccct 540 
gagaagctgg cggctgccaa ggagaagcgc 600 
accctaggag aggatgaccc ctggctggac 660 
cagctgcaga aggagaagga cctggcagag 72 0 
caagagtttg gtgtcagcac tctggtggag 780 
tacagtgccc gggacctgca gggcctcact 840 
ggggagacaa tgattcttac cctcaaggac 900 
ctggtgaacg tgaacctggt ggataaggag 960 
aagaagcctg actacctgcc ctatgccgag 1020 
aaacctcgct ctatcctgtc caagtatgac 1080 
ttccgcttgg agcagggcgg cacggctgat 1140 
cgggccaagc tgcggctgca ggctcagtcc 1200 
gaatacctca cgcctgagga gatggtgacc 1260 
atccgcaaga aggagaagga ggtagtagtg 1320 
cagactcagg atggggactt tggttccaga 1380 
gaagtggagg aggagaagga gcctgtgcct 1440 
gagaacatgg acatcagtga tgaggaggaa 1500 
gtgctggagg aggacgaggc ggagctggag 1560 
ctgcgacagt tacagcagct acagcagctg 1620 
gtgaagaagc tggagtctcg ccagcggggc 1680 
ggggccatcg tgttcaacgc cacgtccgag 1740 
tacgggctgg ctggcaatcg cgaggagcag 1800 
gagcgctcag ccaacggtgg ctccgaatct 1860 
gtgaacctgg acgaggagaa gcagcagcag 1920 
gacgaggaac cgatcgtgaa tagggggctg 1980 
gggctgccgg agaccacagt gcagaaggtg 2040 
ccctcagccg tgtactgcat cgaggataag 2100 
gaggaatacc gaggcttcac acaggacttc 2160 
aagatcgaat acgtggatga gacgggccgg 2220 
ctgtcgcacc gcttccatgg caagggctca 2280 
aagctggacg aggaggcgct cctgaagaag 2340 
gtggccctgc tccaggagaa gcagaaggct 2400 
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cagaagaccc cctacatcgt gctcagcggc agcggcaaga gcatgaacgc gaacaccatc 2460 
accaagtgac agcgccctcc cgccccggcc ctgcctcaac cttcatatca aataaagctc 2520 
cctccttatt ttttcaaaaa aaaaaaaaaa * 2550 



<210> 106 
<211> 2566 
<212> DNA . 
<213> Homo sapiens 

<220> 1 

<221> .misc.feature 

<223> : Incyte ID No : 5678487CB1 - v 



<400> 106 
cggctcgagg , 
gcccccaaga 
gcgggcgctc 
'cacgtaactc. 
ccctggagga 
ttcaccaact 
atctttcttt 
aacagagcca 
gtgaagtatc 
aaaccaatat 
ttcatcattt 
aaatacaagt 
aagaggagac 
ggattacaac 
acaatgcaga 
aacttactta 
acttagctga 
ttcttctggc 
atgaagctgt 
ttaaagaaag 
cgcagcaaag 
aagccaattt 
agaggcaggc 
agaaagacat 
tgatcttgtc 
agatttcagc 
atgtcacaaa 
atgaggcaga 
aagatcagga 
aggagcagca 
gcgaatctga 
ttagcaagga 
agaagataca 
tgacctccca 
aactggaaca 
ggtatacata 
gtgtccagca 
acagccaggc 
tggagaggca 
cacaatctga 
ttacattcat 
accgcctcag 
tcgggctgga 



tgaggactac 

gctctgtgcg 

cccggccaca 

ccagctctgg 

gattctgggg 

gacatgtgcc 

tagcaccagt 

tgctgttaat 1 

tccgtcaaga 

gcctactatg 

agaggcagaa 

agttgtgctt 

actgagggaa 

aggcgaagat 

ttttggcaaa 

tgaggaaaag 

atatcagaga 

tgctaatact 

tctttcccaa 

agatgacttg 

agaagcaagt 

tgaaaaaacc 

ggagcgactt 

gatgaaaaag 

tcagaatatt 

tattaatcaa 

ggtgtgtgga 

aaaggagcac 

aatagagaaa 

gaaggcagcc 

gcaccaactg 

agcaaaggcc 

gcaaatggaa 

gaatacattt 

aatctctcaa 

tgataaattg 

tgggagagta 

cacagcccag 

gagcctgtcg 

ttgctgacct 

ctggtttaga 

cctgcagtgg 

gctggagtct 



aactcccgac 
'ggattctagg 
ggcctgttgt 
gcctagagtg 
cagtatcaac 
ctgaaagaag 
gtgggaaatg 
cagctcaaag 
agaagaaaaa 
cacgaccttg 
gttaagttct 
gaaaacgaag 
caaacacttc 
tctggggtgg 
gctgcatctg 
tgtgaaattg 
acttgtgaag 
tgtaaccgtg 
acccatacta 
atgtctgcac 
gcttatgaac 
aaggctttaa 
gaaaaagatc 
gaaataacaa 
gcccaactgg 
ctggaggaaa 
gaaatgcgct 
agagagttca 
ttgagaatag 
ctggccagag 
cacctcacca 
caagcccttc 
gcccagcatg 
ttgacaaagt 
aaaaccagat 
ggaaagttac 
catgagacga 
cagctggtgc 
gaagaggtgg 
ggatggaaca 
cttaatatgc 
ggctggtcct 
gactctagct 



gtgcaaagcg 
ctcccctgtg 
tctcggaagg 
cgtgcatggc 
ggagtctccg 
gcgatgtcac 
aggacgccag 
atttgttgcg 
tgtccccctt 
ttcatactat 
gcaaggagga 
ggctccagca 
tggatgcatc 
gcgaaacctc 
ctggtgagca 
aggaatccca 
atcttaaaga 
ttggtggtct 
atgttcatat 
tagtttccgt 
aggtgaaaca 
tccagtgtga 
ttgcatctca 
aagaaaggga 
aggcccaggt 
ttcaaagcca 
atcagctgaa 
gagcaaaaac 
aactggatga 
aggagtgcct 
gacaggaaaa 
aggcccagca 
acaaaactga 
taaaggaaga 
ctgaaatagc 
agagaagaaa 
tgaagcaaag 
agctcctcag 
accggctgcg 
gagtgaaata 
cacaacgcac 
catcaacgcg 
gagcagagct 



agggccagtg 

acagccgcgg 

gagaaagctg 

ga 5 agtccccg 

ggaacatgcc 

tattggagaa- 

gacagcctgg 

ccaacaagca 

gaggtcatta 

taatgaccag 

actctctgga 

acagctaaaa 

cggaaacatg 

caaaagacca 

gctagaactg 

attgaagttt 

gcaactaaag 

ttgtttgaaa 

gcagaccatc 

aaggagcagc 

agttttgcaa 

ccagttgagg 

gcaagagaaa 

gtacatggga 

ggaaaaggtt 

gctggcttct 

taaaaccaac 

taacagggat 

aagcaaacaa 

gagactaaca 

agatagcatt 

aagagagcag 

aaatgaacag 

atgctgtaca 

tcaactcagt 

tgaagaattg 

gctaaggcag 

caagcagaac 

gacccagtta 

aatgatttac 

cacgaccttc 

ggcgctgtcc 

cctggt , 



ggtgggaaga 
caggaagcag 
gacatttccc 
gagaactcta 
agcagaagca 
gatgcaccaa 
cccgaattac 
gataaggaaa 
gaacatgagg 
tctcaatata 
atgaaaaata 
tctcaaagac 
cacaattctt 
ttttcccatg 
gagaagctaa 
ttgaggaacg 
cataaagaat 
tgtgctcagc 
gaaagactgg 
ttggcagata 
atatctgagg 
aaggagctgg 
agggccattg 
tcaaagatgt 
acaaaggaaa 
cgggaaatgg 
atggagaagg 
cttgaaatta 
cacttggaac 
gaactgctgg 
cagcagagct 
gagctgacac 
tatttgttgc 
ttagccaaga 
caagaaaaaa 
gaggaacagt 
ctggataagc 
cagcttct.cc 
cccagcatgc 
aaagagatat 
ccagggtgac 
ccgcacgcag 



60 ' 

120 

180 

240 

3 00 

360 

420 

480 

5%0 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320, 

1380' 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2566 



<210> 107 
<211> 3022 
<212> DNA 

<213> Homo sapiens 
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-<220> 

<221> misc_f eature 

<223> Incyte lb No: 5682976CB1 



<40a> .107 

gctttcctta 

atgacacttt 

gtctqggggt 

ggaggacccg 

aaaacttgaa 

cactggagaa 

cagcagaaag 

gttcttacct 

ttataccaac 

tggaactact 

tgaagatgga 

ttgtaaagat 

accaatacca 

gcgaatgctg 

tgcccgtttt 

cagtgaagat 

cccgaaagat 

gttgcgacaa 

cagagcaagg 

gatgcagaga 

aagcaataga 

aactcttcct 

agatactcca 

tcattcgaca 

cagtgaacaa 

tccttcttct 

caataatgaa 

cagcacagaa 

tatagcatca 

tttcgtccca 

agaatcatca 

gaaccatatc 

aggagaaaga 

atctgaaaaa 

tgaaaataac 

tgaagaaaca 

agtcctgatc 

ccgtattcag 

tttgaacatt 

aatgataaaa 

ccacattttc 

tcatgtggta 

agattatgac 

tgctgatgaa 

agttccagcc 

gttggagggt 

ataaactctt 

ttttttcttt 

tttttgggat 

taaaacaaaa 

atgcaaagtg 



tttttctaaa 
^aaaagattc 
ggqtcctgcc 
tcccggctgc 
gcaaccctta 
tatatttt-at 
gttttgacaa 
tgtacaaa-tg 
gttgagcaag 
tatgagatta 
actgttaggt 
gatattttaa 
'tattaccttg 
ggcacaagag 
attccttccc 
g : gtcaagaga 
gatacagcac 
ccaccagtta 
ccggagagtg 
atgtctgata 
ggacgaggaa 
acggtcccat 
gctgaacaat 
tcatctccca 
aggcagtctg 
gtggttaaca 
aagctgagcc 
ggaacaacta 
agttctagag 
cagagctcag 
gaggatgtga 
aatataacac 
aatgacctca 
gccaaggaac 
accaatcctg 
tccaccaggg 
ccaggtgcaa 
gagttcttca 
agaaggccgc 
gaagccaatt 
atctgggatc 
aactgcctgc 
ataaagatct 
gttataactc 
tctttcatgt 
gacagatcag 
tttggcaagc 
acagagcttt 
aacctaacat 
ctagcagaat 
caatattttc 



tgttctataa 
iaggtagggtc 
cacacctgtt 
ggagtcgcta 
atgtgcatga 
ctggctcaga 
caattcgttc 
ataaacagat 
atgcagaaac 
tgactgtacc 
ggtttgatac 
ttaactgtcg 
ctgttggttg 
ctacagggaa 
atcttaataa 
ttctcgttag 
gagaacttaa 
agcgtttgag 
aacgagaacg 
tgttatcaag 
gatctcgacc 
caagtcctga 
ttcttcagcc 
cagaaagccc 
ttgaggcatc 
aacagctcgg 
ccaaaccagg 
caagcacaat 
gaattgggag 
tgcaaccacc 
caaaatatca 
aatcagataa 
atcttgatcg 
cagaaacttc 
agcctcagtt 
actctgctct 
ggtatcgagc 
gacggagaaa 
tagtaaaaat 
tctggggtgc 
ggcacactgc 
agccacatcc 
ggtcaccatt 
gaaacgaact 
tgaggatgtt 
aaggctctgg 
acttaaatgt 
agtgcaattt 
tggtttggaa 
gtttttaaaa 
cc 



tgatatiraag 

tcccctccca . 

gtgggacgtg 

cctgggaaga 

tggttgtgtt 

tgacaccaaa 

agggcaccga 

tgtatcctgo' 

caacagacaa- 

caatgaccct; 

acgcatcaaa 

acptgctgcc 

ttctgacagc _ r 

ttatgcaggt *' 

taagtcfctgc 

ttactcttca 

aactccttct 

acttcgtggt 

agatggagag 

atggtttgaa 

cagaggtgga 

tttggaagtg 

ttctacatcc 

tcattctact 

tggacaccac 

atccatgtca 

gacaggtgaa 

aaaactgaac 

ccattgcaaa 

agaaggagac 

ggaaggagta 

gttcacagcc 

ctcttgtggg 

agatcagact 

ccaaacagaa 

tcaggacaca 

aggacctggt 

agaaaggaaa 

ggtttataaa 

taactttgta 

tgagcatttg 

gtttgaccca 

agaagagtca 

catgctggaa 

ggcttcactt 

tcaagagaat 

tctgaaattt 

taaggttatg 

tgattgtgtg 

ctttttgccg 



actatagaac 
cccggctcag 
aggaaaaggt 
agagaattta 
aatacaatct 
ttagtaatta 
gcaaacatat 
tfctggagatg 
tgccaattta 
tacacttttc 
actagctgca 
acgtctgttg 
tcagtacgaa 
cgagggacta 
agagtgacat 
gattacatat 
gcggaagaga 
gattggtcag 
cagagtccca 
gaagcaagtg 
acaagtcaat 
agtgaaactg 
tctacaatgt 
cctttgctat 
acacatcatc 
cttgacgagc 
ccagttttaa 
tttacagatg 
tctgagggtc 
agtgaaacaa 
tctgcagaaa 
aagccattgg 
gttccagaag 
agcactgaga 
gccactgggc 
gatgacagtg 
gatagacgct 
gaaatggaag 
ggccatcgca 
atgagtggtt 
atgcttctgg 
attttagcct 
aggattttta 
gaaactagaa 
aatcatatcc 
gaaaatgagg 
gtataagaca 
gtttttggag 
catgaatttg 
tgtatgagga 



tatctgtttt 
gcagagccat 
ccctcgggct 
tccaaagatt 
gttggaatga 
gtaatcctta 
ttagtgcaaa 
gagtaatatt 
cgtgtcatta 
tctcttgtgg 
caaaagaaga 
ctatttgccc 
Gatatgatcg 
ctggaatggt 
ctctgtgtta 
atccttttqa 
gaagagaaga 
atactggacc 
atgtgtcatt 
aggttgcaca 
cagatatttc 
caatggaagt 
cagctcaggc 
cttctccaga 
agtctgattc 
aacaggataa 
gtttgcacta 
aatggagcag 
aggaggaatc 
aagctcctga 
acccagttga 
attccaactc 
aatctgcttc 
gtgctaccaa 
cttcagctca 
atgatgaccc 
ctgctgttgc 
aattggatac 
actccaggac 
ctgactgtgg 
aagctgataa 
catctggcat 
accgaaaact 
acaccattac 
gagctgaccg 
atgaggaata 
tttattatat 
tttttccctt 
ggagattgta 
gtgctagaaa 



60 
120 
180 
240 
300 
360 
.-420 
480' 
540. 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040' 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3022 



<210> 108 

<211> 2787 

<212> DNA 

<213> Homo sapiens 



<220> 



92/93 



WO 01/07471 



PCT/US00/19948 



<221> misc_feature 

<223> Incyte ID No: 5992432CB1 



<400> 108 r 
gtcgtcgaaa 
aacgggtctg 
tcctgtacct 
ccggaccttg 
?ttctctgtca 
ctcaatcagc 
cgatacctga 
. gaagatgtca 
"~ tggctagatt 
ttcgaccgtg 
ctgcgcttcc 
ctcaagctga 
.ctggatgagg 
gccggcaagt 
gacaaggtac 
gaccagctgg 
gagaaggctc 
acacaggtgt 
accgcctcgg 
cgcttcgagc 
aacccacacc 
gagatcatca 
aagccccaca 
gatgcccgtg 
gcaagcgtgt 
ttgcggctgc 
tcagagcccg 
ctggaggaga 
ctgcgtatcg 
tacttcgagg 
gtgtccgaca 
ctggagcggg 
aagaccttgt 
atggccgtgt 
aacatctaca 
cagaaggcca 
gacatggagt 
cagatctgtg 
cggcatggca 
acgtacaaca 
acgggcaccg 
gaacagcggg 
agcaagatcc 
caggtcaacc 
gagcccaacg 
aaggaagact 
cgtttgtaca 



agaagtcaat 

gctcggaggg 

gggcatccag 

tcttcgagga 

aatgctggct 

tatacgagcg 

aggcgcg.tcg 

acaactgtca 

actgccagtt 

ccctccgggc 

tgcgctcaca 

gtcctgagag 

ccgcccagcg 

ccaactacca 

agtccctcaa 

gcaagctctg 

gggacgtgta 

ttgacagcta 

agctggggcg 

agctcatcag 

acgtgcacga 

acacctacac 

ctctgtgggt 

tcatcctgga 

ggtgtcagtg 

tgcgaaaggc 

tgcagaaccg 

gcctcggcac 

caacacccca 

agagcttcaa 

tctggagcac 

cacgggacct 

acctgctgta 

acgagcgtgc 

tcaagcgggc 

ttgaggtgct 

gcaagctcgg 

acccccggac 

atgaggacac 

cgcaggtcaa 

tgtctgacct 

cagagcagct 

tgttcgtgag 

ccgaggagat 

aggttcggct 

gacccgtccc 

tcaaaaaaaa 



aacgtgggcc 

gcgggccgtc 

aaaaatggtg 

agaggacctc 

tcgctacatc 

ggcactcaag 

ggcacaggtg 

tgagagggcc 

cctcatggac 

actgcccatc 

cccactgcct 

tgcagaggag 

cctggccacc 

gctgtggcac 

tgtggacgpc 

gtgttctctc 

cgaggaggcc 

cgcccagttc 

cgaggaggag 

ccggcggccc 

gtggcacaag 

agaggctgtg 

ggcgtttgcc 

gaaggccacc 

cggagagctg 

cacggcgctg 

cgtgtacaag 

cttccagtcc 

gatcgtcatc 

ggcgtacgag 

ctacctgacc 

gtttgaacag 

cgcacagctg 

caccagggcc 

ggccgagatc 

gtcggacgag 

ggagattgac 

gaccggcgcg 

catcaaggaa 

cttcatggcc 

ggcccctggg 

ggcggctgag 

gagtgacgcc 

ccagctgggc 

ggagcagcag 

tcccccctcc 

aaaaaaa 



tgtccgtcaa 

agtggtagac' 

gtgatggcgc 

ccctatgagg v 

gagttcaaac 

ctgctgccct 

aagcatcgct 

tttgcgttca 

caggggcgcg 

acgcagcact 

gagacagctg 

tacattgagt 

gtggtgaacg 

gagctgtgcg 

atcatccgcg 

gccgactact 

atccggacag, 

gaggagagca 

gatgatgtgg 

ctgctcctca 

cgtgtcgccc 

cagacggtgg 

aagttttatg 

aaggtgaact 

gagctccgac 

cctgcccgcc 

tcactgaagg 

accaaggccg 

aactatgcca 

cgcggcatct 

aaattcattg 

gctctggacg 

gaggaggagt 

gtggagcccg 

tatggggtca 

cacgcgcgtg 

cgcgcccggg 

ttctggcaga 

atgctgcgta 

tcgcagatgc 

cagagtggca 

gcggagcgtg 

tcccgggagg 

gaggacgagg 

agcgtgccag 

cccctcccca 



aaatgattta 

gtcataagcg 

gactttcgcg 

aggaaatcat 

agggcgcccc 

gcagctacaaL 

gtgtgaccga 

tgcacaagat 

tcacacacac 

ctcgaatttg 

tgcgaggcta 

acctcaagtc 

acgagcgttt 

acctcatctc 

ggggcctcac 

acatccgcag 

tgatgaccgt 

tgatcgctgc 

acctggagct 

acagcgtctt 

tgcaccaggg 

accccttcaa 

aggacaacgg 

tcaagcaggt 

acgagaacta 

gggccgagta 

tctggtccat 

tgtacgaccg 

tgttcctgga 

cgctgttcaa 

cccgctatgg 

gctgcccccc 

ggggcctggc 

cccagcagta 

cccacacccg 

agatgtgcct 

ccatctacag 

cgtggaagga 

tccggcgcag 

tcaaggtctc 

tggacgacat 

accagccctt 

agctggcaga 

acgaggacga 

ccgcagtgtt 

ccccctcccc 



accaatagaa 

cgcgactctc 

gcccgagcgg 

gcggaaccaa 

gaagcccagg 

actctggtac 

ccctgcctat' 

gcctqgtctg 

ccgccgcacc 

gcccctgtat 

tcggcgcttc 

aagtgaccgg 

cgtgtctaag 

ccagaatccg 

ccgcttcacc 

cggccatttc 

gcgggacttc 

aaagatggag 

gcgcctggcc 

gctgcgccaa 

ccgcccccgg 

ggccacaggc 

acagctggac 

ggatgacclig 

cgatgaggcc 

ctttgatggt 

gctcgccgac 

catcctggac 

ggagcacaag 

gtggcccaac 

gggccgcaag 

aaaatatgcc 

ccggcatgcc 

tgacatgttc 

cggcatctac 

gcggtttgca 

cttctgctcc 

ctttgaggtc 

cgtgcaggcc 

gggcagtgcc 

gaagctgctg 

gcgcgcccag 

gctggcacag 

gatggacctg 

tgggagcctg 

aatacagcta 



60 

120 

180 

240. 

300 

360 

42 0 

480 

540 

600 

660 

720 

780 

840 

900 : 

960* 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2787 



93/93 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) WorJd Intellectual Property Organization 

lntcrnaiional Bureau 

(43) International Publication Date 
1 February 2001 (f)1.02.2001) 




PCT 



(10) International Publication Number 

WO 01/07471 A3 



(51) International Patent Classification 7 : • CI2N 15/12. 

■! 5/10, C07K 14/47, 16/18. C12Q 1/68. A61K 38/17. G0IN- 
33/50. AO IK 67/027 

(21) International Application Number: PCT/US0O/ 19948 

(22) International Fiiin« Date: 21 July 2000 (21.07.2000) 

(25) Filing Language: * * English 

(26) Publication Language: English 



Canyon Drive, -Castro Valley,, CA 94552 (US). YANG, 
Junriiing |CN/US]; 7125 Bark Lane. San Jose. CA 95 129 
(US). LU, Dyung, Aina, M [US/US]; 233 Coy Drjve, Sun 
Jose, C A 95123 (US). BAUGHN, Mariali, R. |US/US1: 
14244 Santiago Road, San Leandro. CA 94577 (US). 
PATTERSON, Chandra [US/US): 490 Sherwood Way 
#1, Menlo Park, CA 9*025 (US). SHAH; Purvi jlN/USl; 
1608 Queen Charlotte Drive #5, Sunnyvale, CA 94087 
(US). . < '' ' 



(74) Agents: HAMLET-COX, Diana et al.: Incyte Genomics. 
: : Inc.. 3 1 60 Porter Drive, Palo Alio, C A 94304 (US). 



(30) Priority Data: 

60/145*075 
60/153.129 
60/164.647 



21 July 1999 (21.07.1999) US 
S Sepiember 1 999 (08.09. 1999) US 
1 0 November 1 999 (10. 1 1.1999) US 



(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier applications: 

US 60/145,075 (CIP) 

Filed on 21 July 1999 (21.07.1999) 

US 60/153.129 (CIP) 

Filed on 8 Sepiember 1999 (08.09. 1 999) 

US 60/164,647 (CIP) 

Filed on 1 0 November 1 999 ( 1 0. 1 1 . 1 999) 

(71) Applicant (for all designated States except US) : I N C YTE 
GENOMICS, INC. {US/US |; 3160 Porter Drive, Palo 
Alio, CA 94304 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): HILLMAN, Jen- 
nifer, L. |US/USj: 230 Monroe Drive #17, Moumain 
i View, CA 94040 (US). LA L, Precti fIN/USI; 2382 Lass 

I Drive. Sania Clara, CA 95054 (US). TANG, Y-, Tom 

I ICN/USj; 4230 Ranwick Court. San Jose, CA 951 18 (US). 

| YUE, Henry [US/US]; 826 Lois Avenue. Sunnyvale, CA 

; 94087 (US). AU -YOUNG, Janice IUS/US1; 233 Golden 

! Eagle Lane. Brisbane. CA 94005 (US). BAN DM AN, 

Olga [US/US]: 366 Anna Avenue, Mountain View. CA 
94043 (US). AZIMZAI, Yalda | US/US]; 5518 Boulder 



(81) Designated States (national): AE. AL. AM. AT, AU, AZ. 
BA, BB. BG, BR. BY. CA. CH. CN, CR. CU. CZ, DE, OK. 
DM. EE, ES, FL GB, GI), GE. CH. GM. !IR, HU. ID. 1L. 
IN. IS, JP, KE, KG, KP. KR. KZ. LC. LK. LR. LS. LT. LU, 
LV. MA, MD, MG, MK, MN. MVV. MX. NO. NZ. PL. PT. 
RO. RU, SD, SE. SG. SI. SK. SL, TJ. TM.TR. TT. TZ, U A. 
UG, US, UZ. VN, YU, ZA. ZW. 

(84) Designated States (regional)'. AR1PO patent (GH. GM, 
KE. LS. MW, MZ. SD, SL. SZ. TZ. UG. ZW). Eurasian 
patent (AM, AZ, BY. KG, KZ. MD. RU, TJ. TM ), European 
patent (AT, BE. CH. CY, DE. DK. ES, FL FR. GB. GR. IE. 
IT. LU, MC, NL, PT. SE). OAPI patent (BF. BJ, CF. CC, 
CI. CM. G A, GN. GW, ML. MR, NE, SN, TD, TG). 

Published: 

— with international search report 

(88) Date of publication of the international search report: 

17 January 2002 

(15) Information about Correction: 
Previous Correction: 

see PCT Gazette No. 20/2001 of 17 May 2001, Section II 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCI' Gazette. 



< 



(54) Title: CELL CYCLE AND PROLIFERATION PROTEINS 

(57) Abstract: The invention provides human cell cycle and proliferation proteins (CCYPR) and polynucleotides which identify 
and encode CCYPR. The invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention 
also provides methods for diagnosing, treating, or preventing disorders associated with expression of CCYPR. 



INTERNATIONAL SEARCH REPORT 



Intt 



j| Application No 

PCT/US 00/19948 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C12N15/12 C12N5'/10 
A61K38/17 



C07K14/47 C07K16/18 C12Q1/68 
G01N33/5G A01K67/027 ^x/oa 



According to International Patent Classification <fPC) or to both national classification and IPC 



B. FIELDS SEARCHED * <' ' 

Minimum documentation searched (classification system followed by classification symbols* 

IPC 7 C12N C07K C12Q A61K ; G01N A01K 



searched other than minimum documentation to the-extent that such documents are included in trie fields searched 



base consulted during the, international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to cfaim No. 



NEM0T0 Y ET AL: "Recruitment of an 
alternatively spliced form of synaptojanin 
2 to mitochondria by the interaction with 
the PDZ domain of a mitochondrial outer 
membrane protein" 
EMBO JOURNAL. , 

vol. 18, no. 11, 1 June 1999 (1999-06-01), 
pages 2991-3006, XP002156389 
Rat 0MP25: 88.966% identity in 145 aa 
overlap with SeqldNo.l / 75.835% identity 
in 1167 nt overlap with SeqIdNo.55 

WO 98 45436 A (GENETICS INST) 
15 October 1998 (1998-10-15) 
SeqIdNo.1414: 99.8% identity in 432 bp 
overlap with SeqIdNo.55 

-/-- 



1,3,6,7, 
9-11,13, 
15,19, 
22,25,26 



3,11,12 



| X 1 P urtn © r documents are listed in the continuation of box C. 



1 Special categories of cited documents : 

"A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

"L" document which may throw doubts on priority claim (s) or 
■ which is cited to establish the publication date of another 
citation or other special reason (as specified) 
"O" document referring to an oral disclosure, use. exhibition or 
other means 

*P" document published prior to the international filing date but 
later than the priority date claimed 



|X 1 Patent family members are listed in annex. 



^ ^nSX"!?? P" b i isned aft er the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
i nve ntton 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken atone 

"V document of particular relevance: the cfaimed invention 

cannot be considered to involve an inventive step when the 
aocument is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent famify 



Date of the actual completion of the international search 



4 January 2G01 



Name and mailing address of the ISA 

European Patent Office. P. B. 5818 Patenilaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70)340-3016 



Date of mailing of the interna rional search report 



75. 04 2001 



Authorized officer 



Lonnoy, 0 



Form PCT/1SA/210 (second sheet) (Juty 19S2) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



Intt jl Application No 

PCT/US 00/19948 



C(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 0 | Citation ot document, with indication, where appropriate, oi the relevant passages 



Relevant to daim No. 



EP 1 033 401 A (GENSET) 

6 September 2000 (2000-09-06) 

SeqldNo. 3623: 100.000% identity in 374 nt 

overlap with SeqldNo. 55 

-& DATABASE GENESEQ [Online] 

E.B.I . , Hinxton. U.K. ; 

Accession Number: C03525, 

6 October 2GG0 (2000-10-06) 

DUMAS M ET AL: "Human secreted protein 5 r - 

EST, SeqldNo. 3623" 

XP002156390 

abstract 

WO 97 12962 A (COLD SPRING HARBOR LAB 
-BEACH DAVID- (US); CALIGIURI MAUREEN 
(US);) 10 April 1997 (1997-04-10), 



1,3,6,7, 
9-15 



Form PCT/ISA/210 (continuabon ot second sheet) (July 1992) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT, 



ii nei..dtional application No. 

PCI/US GO/19948 



Box I Obseivations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This Internation^earch Report has not been Bstaolished^in respect of certain claims under Article 1 7(2)<a) for the blowing 

1 - j j Claims Nos.; *' j ) 

. because they relate to subject matter not required to be searched by.this Authority.^arnely: 



reasons: 



Claims Nos.: 



because they relate to parts of the International Application that do not comply with* the orescribed r * 
an extent that no meaningful International Searchxan be carried out, specifically- * P rescn °ed i 

k K . * . 



J requirements to such 



3. | { Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third 



sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



see additional sheet 



1 " D ^^^i^S^ 0 ^ SCarCh fCeS W6re time,y Paid by ** aPPlfeant. this International 



Search Report covers all 



2 - ^ C0U ' d bS S6arChed With ° Ut eff0rt jUStifying an addilional '-• «- Authority did no, invi.e payment 



3 " I I As onIy some ot the rec * uired additional search fees were timely paid bv the aoolicant thi« into™^;^ i e 

1 * covers only those claims for which fees were paid, specifically claims Nos.: PPl, ° am ' th,S lnlemat,on al Search Report 



4 * LxJ ri 0 /^*!"!^^ « a «=*! ti0 « al .s«ar=^ tees were timely paid by the applicant. Conseauently this International 
restricted to the tnvention first mentioned in the claims; it is covered by claims Nos :' ,nternatl0nal 

see further information sheet invention group 1. 



Search Report is 



Remark on Protest 



I I The addit 'onai search fees were accompanied by the applicant's protest. 
I I No P rotest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (1)) (July 1998) 



International Application No. PCT/US 00 /19948 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



Patent document 
cited in search report 



kite. . ,al Appficatio n No 

PCT/US 0G/19948 



Publication 
date 



Patent family 
member(s) 



WO 9845436 



15^10-1998 



Publication 
date 



AU 

EP't. 



.6891098 A 



30-10-1998 



tp 


1033401 


A 

, f 


06-09-2000 


NONE 










wo 


9712962 


A 


10-04^1997 

i" 


-US 
' EP 


6001619 A 
0857205 A , 


14- 
12- 


•12- 
■08- 


■1999 
■1998 



Form PCT/lSA/210 (patent family annex) {July 1992) 



